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HDAC9 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES THEREOF 

RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No. 
5 60/298,173 filed on June 14, 2001, U.S. Provisional Application No. 60/31 1,686 
filed on August 10, 2001, and U.S. Provisional Application No. 60/316,995, filed or 
September 4, 2001. The entire teachings of the above applications are incorporated 
herein by reference. 



10 GOVERNMENT SUPPORT 

The invention was supported, in whole or in part, by grant CA-0974823 from 
the National Cancer Institute. The Government has certain rights in the invention. 



BACKGROUND OF THE INVENTION 
15 The N^enninal tails of core histones are covalently modified by post- 

translational modifications, including acetylation and phosphorylation. Evidence 
suggests that these covalent modifications play important roles in several biological 
activities involving chromatin, e.g., transcription and replication. Histone 
deacetylases (HDACs) catalyze the removal of the acetyl group from the lysine 
20 residues in the N-terminal tails of nucleosomal core histones resulting in a more 
compact chromatin structure, a configuration that is generally associated with 
repression of transcription. 

Five proteins and/or open reading frames in yeast (RPD3, HDA1, HOS1, 
HOS2 and HOS3) that share significant homology in the catalytic domain have been 
25 identified as HDACs based upon their sequence homology to human HDAC1 . To 
date, eight HDACs have been identified in mammalian cells, and classified into two 
classes based on their structure and similarity to yeast RPD3 or HDA1 proteins. 
Recently, Sir2 family proteins that are structurally unrelated to the five proteins 
aforementioned have been identified as NAD-dependent HDACs. Class I HDACs 
30 are the yeast RPD3 homologs HDAC1, 2, 3, and 8, and are composed primarily of a 
catalytic domain. Class H HDACs are the yeast HDA1 homologs HDAC4, 5, 6,< and 
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7. HDAC4, 5, and 7 contain a long non-catalytic N-terminal end and a C-tenninal 
HDAC catalytic domain while HD AC6 has two HDAC catalytic domains. 

It has also been determined that histone deacetylases can be sensitive to 
small molecules, including trichostatin A (TS A), trapoxin, and butyrate. For 
5 example, the yeast RPD3 and HDA1 and mammalian HDAC1, 2, 3, 4, 5, 6, 7 and 8 
are sensitive to inhibition by trichostatin A (TSA). The Sir2 family HDACs, yeast 
HOS3 and Drosophila melanogaster dHDAC6, however, appear to be relatively 
insensitive to TSA. A class of hybrid bipolar compounds, such as suberoylanilide 
hydroxamic acid (SAHA) have also been shown to inhibit histone deacetylases and 

1 0 induce terminal differentiation and/or apoptosis in various transformed cells. 

Examples of such compounds can be found in U.S. Patent Nos. 5,369,108, issued on 
November 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, issued 
on June 30, 1998 to Breslow et ai, as well as U.S. Patent Nos. 5,055,608, issued on 
October 8, 1991, and 5,175,191, issued on December 29, 1992 to Marks et al 9 the 

1 5 entire content of all of which are hereby incorporated by reference. 

The identification of the mechanisms by which histones are deacetylated, and 
the characterization of histone deacetylase function would be of great benefit in 
understanding how gene transcription is controlled, how the cell cycle is regulated, 
and how cells are signaled to undergo terminal differentiation and/or apoptosis. 

20 Elucidation of such mechanisms can lead to improved therapeutics for many 
diseases, in particular those characterized by cell proliferation or a lack of cell 
differentiation or apoptosis, for example, cancer. 

SUMMARY OF THE INVENTION 
25 The present invention relates to isolated or recombinant histone deacetylase 

polypeptides, and isolated histone deacetylase nucleic acid molecules encoding those 
polypeptides, as well as vectors and cells containing those isolated nucleic acid 
molecules. 

In one aspect of the invention, the isolated or recombinant histone 
30 deacetylase polypeptide is selected from a) an isolated or recombinant polypeptide 
comprising SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10; and b) a polypeptide having at least 60% sequence identity with any one 
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of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 
10. In one embodiment, the isolated or recombinant histone deacetylase polypeptide 
consists of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10. In another embodiment, the isolated or recombinant histone deacetylase 
5 polypeptide is mammalian; preferably, the isolated or recombinant histone 
deacetylase polypeptide is human. 

In another aspect, the invention features an isolated nucleic acid molecule 
selected from a) an isolated nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; b) a complement of an isolated 
10 nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, or SEQ ID NO: 9; c) an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or 
SEQ ID NO: 10; d) a complement of an isolated nucleic acid encoding a histone 
deacetylase polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
1 5 NO: 8, or SEQ ID NO: 1 0; e) a nucleic acid mat is hybridizeable under high 

stringency conditions to a nucleic acid molecule that encodes any of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or a complement thereof; or f) a 
nucleic acid molecule that is hybridizeable under high stringency conditions to a 
nucleic acid comprising SEQ ID NO: 1, SEQ JD NO: 3, SEQ ID NO: 5, or SEQ ID 
20 NO: 7; and g) an isolated nucleic acid molecule that has at least 55% sequence 
identity with any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, or a complement thereof. In one embodiment, the isolated 
nucleic acid molecule consists of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, or SEQ ID NO: 9. In another embodiment, the isolated nucleic acid 
25 molecule is mammalian; preferably, the isolated nucleic acid molecule is human. 

In other aspects, the invention features a vector comprising the isolated 
histone deacetylase nucleic acid molecule described above, a cell comprising the 
vector, and a cell comprising the isolated histone deacetylase nucleic acid molecule 
described above. 

30 lh another aspect, the invention features a purified antibody that selectively 

binds a histone deacetylase polypeptide described above. 
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In yet another aspect, the invention features a method of identifying a 
compound that modulates expression of a histone deacetylase nucleic acid molecule 
described above. The method comprises the steps of a) contacting the nucleic acid 
molecule with a candidate compound under conditions suitable for expression; and 
5 b) assessing the level of expression of the nucleic acid molecule. A candidate 
compound that increases or decreases expression of the nucleic acid molecule 
relative to a control is a compound that modulates expression of the nucleic acid 
molecule. In one embodiment, the method is carried out in a cell or animal. In 
another embodiment, the method is carried out in a cell free system. 

1 0 The invention also features a method of treating a cell proliferation disease, 

an apoptotic disease, or a cell differentiation disease, for example, cancers such as 
lymphoma, leukemia, melanoma, ovarian cancer, breast cancer, pancreatic cancer, 
prostate cancer, colon cancer, and lung cancer and myeloproliferative disorders, 
including polycythemia vera, essential thrombocythemia, agnogenic myeloid 

1 5 metaplasia, and chronic myelogenous leukemia in an individual, comprising 
administering a compound identified by the above method. . 

In still another aspect, the invention features a method of identifying a 
compound that modulates the enzymatic activity of the histone deacetylase 
polypeptide described above. The method comprises the steps of a) contacting the 

20 polypeptide with a candidate compound under conditions suitable for enzymatic 
reaction; and b) assessing the activity level of the polypeptide. A candidate 
compound that increases or decreases the activity level of the polypeptide relative to 
a control is a compound that modulates the enzymatic activity of the polypeptide. In 
one embodiment, the method is carried out in a cell or animal. In another 

25 embodiment, the method is carried out in a cell free system. 

In yet another embodiment, the polypeptide is further contacted with a 
substrate for the polypeptide, wherein the substrate is selected from the group 
consisting of a cell proliferation disease binding agent, an apoptotic disease binding 
agent, and a cell differentiation disease binding agent. In one embodiment, the 

30 candidate compound is an inhibitor. In another embodiment, candidate compound is 
an activator. 
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In another aspect, the invention features a method of identifying a compound 
that modulates the transcriptional repression activity of the histone deacetylase 
polypeptide described above. The method comprises the steps of a) contacting the 
polypeptide with a candidate compound under conditions suitable for a 
5 transcriptional repression reaction; and b) assessing the transcriptional repression 
activity level of the polypeptide. A candidate compound that increases or decreases 
the transcriptional repression activity level of the polypeptide relative to a control is 
a compound that modulates the transcriptional repression activity of the polypeptide. 
In one embodiment, the method is carried out in a cell or animal. In another 
10 embodiment, the method is carried out in a cell free system. 

In yet another embodiment, the polypeptide is further contacted with a 
substrate for the polypeptide, wherein the substrate is selected from the group 
consisting of a cell proliferation disease binding agent, an apoptotic disease binding 
agent, and a cell differentiation disease binding agent. In one embodiment, the 
15 candidate compound is an inhibitor. In another embodiment, candidate compound is 
an activator. 

In another aspect, the invention features a method of identifying a compound 
that modulates expression of a histone deacetylase nucleic acid molecule described 
above. The method comprises the steps of a) providing a nucleic acid molecule 

20 comprising a promoter region of the histone deacetylase nucleic acid molecule 
described above, or part of such a promoter region, operably linked to a reporter 
gene; b) contacting the nucleic acid molecule or with a candidate compound; and c) 
assessing the level of the reporter gene. A candidate compound that increases or 
decreases expression of the reporter gene relative to a control is a compound that 

25 modulates expression of the histone deacetylase nucleic acid molecule described 
above. In one embodiment, the method is carried out in a cell. 

In still another aspect, the invention features a method of identifying a 
polypeptide that interacts with a histone deacetylase polypeptide described above in 
a yeast two-hybrid system. The method comprises the steps of a) providing a first 

30 nucleic acid vector comprising a nucleic acid molecule encoding a DNA binding 
domain and the histone deacetylase polypeptide described above; b) providing a 
second nucleic acid vector comprising a nucleic acid encoding a transcription 
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activation domain and a nucleic acid encoding a test polypeptide; c) contacting the 
first nucleic acid vector with the second nucleic acid vector in a yeast two-hybrid 
system; and d) assessing transcriptional activation in the yeast two-hybrid system. 
An increase in transcriptional activation relative to a control indicates that the test 
5 polypeptide is a polypeptide that interacts with the histone deacetylase polypeptide 
described above. 

The invention also features a pharmaceutical composition comprising a 
histone deacetylase polypeptide described above. 

In addition, the present invention features a method of diagnosing a cell 

1 0 proliferation disease, an apoptotic disease, or a cell differentiation disease in a 

subject. The method comprises the steps of a) obtaining a sample from the subject; 
and b) assessing the level of activity or expression of the histone deacetylase 
polypeptide described above or the level of the nucleic acid molecule described 
above in the sample. If the level is increased relative to a control, then the subject 

15 has an increased likelihood of having a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease, and if the level is decreased relative to a 
control, then the subject has a decreased likelihood of having a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease. In one embodiment, 
the polypeptide level is assayed using immunohistochemistry techniques, hi another 
20 embodiment, the nucleic acid molecule level is assayed using in situ hybridization 
techniques. 

Compounds and/or polypeptides identified in the above-described screening 
methods are also part of the present invention. 



25 DESCRIPTION OF THE FIGURES 

FIG. 1 is a schematic representation of the order in which FIGS. 1A-10 
should be viewed. 

FIGS. 1A-1C show the cDNA sequence of HDAC9 (SEQ ID NO: 1). The 
arrows and numbers in the HDAC9 sequence indicate exons. The boxed portion of 
30 the sequence indicates the HDAC domain. 
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FIGS. 1D-1G show the cDNA sequence of HDAC9a (SEQ ID NO: 3). The 
arrows and numbers in the HDAC9a sequence indicate exons. The boxed portion of 
the sequence indicates the HDAC domain. 

FIGS. 1H-1I show the cDNA sequence of HDRP(ANLS) (SEQ ID NO:9). 
5 FIGS. 1 ML show the cDNA sequence of HDAC9(ANLS) (SEQ ID NO:5). 

FIGS. 1M-10 show the cDNA sequence of HDAC9a(ANLS) (SEQ ID 

NO:7). 

FIG. 2 is a schematic representation of the order in which FIGS. 2A-2E 
should be viewed. 

10 FIG. 2A shows the amino acid sequence of HDAC9 (SEQ ID NO: 2). 

FIG. 2B shows the amino acid sequence of HDAC9a (SEQ ID NO: 4). 
FIG. 2C shows the amino acid sequence of HDAC9(ANLS) (SEQ ID NO: 6). 
FIG. 2D shows the amino acid sequence of HDAC9a(ANLS) (SEQ ID NO: 

8). 

1 5 FIG. 2E shows the amino acid sequence of and HDRP(ANLS) (SEQ ID NO: 

10) . 

FIG. 3 is a schematic representation of the order in which FIGS. 3A-3C 
should be viewed. 

FIGS. 3A-3C show an amino acid sequence alignment of HDRP (SEQ ID 
20 NO: 1 1), HDAC9 (SEQ ID NO: 2), HDAC9a (SEQ ID NO: 4), and HDAC4 (SEQ 
ID NO: 12) polypeptides. Amino acid sequences of HDAC9 (GenBank Accession: 
AY032737; SEQ ID NO: 2) and HDAC9a (GenBank Accession:AY032738; SEQ 
ID NO: 4) are aligned with HDRP (GenBank Accession: BAA34464; SEQ D NO: 

1 1) and HDAC4 (GenBank Accession: NP _006028; SEQ ID NO: 12). The identical 
25 residues in all proteins are boxed with solid lines. The similar residues are boxed 

with dotted lines. 

FIG. 4 shows a schematic representation of the human HDAC9 gene 
structure. The striped boxes represent exons present in isoforms HDRP, HDAC9a, 
and HDAC9. The lines represent introns. Broken lines are used for larger introns 
30 (with size in base pair on top). The 5' untranslated region cDNA and coding region 
cDNA are represented here. Exons 1-12 encode a non-catalytic domain of the 
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polypeptides, and exons 14-21 encode the histone deacetylase catalytic domain of 
the polypeptides, which provide the polypeptides with deacetylase activity. 

FIG. 5 is a schematic representation of the order in which FIGS. 5A-5D 
should be viewed. 

5 FIGS. 5A-5D show the nucleic acid sequence of HDAC9, containing all 

exons expressed in the various isoforms of HDAC9, HDAC9a, HDAC9(ANLS) 9 
HDAC9a(ANLS), and HDRP(ANLS) of the present invention (SEQ ID NO:13). 

FIG. 6A is a scanned imaged of a multiple human tissue Northern blot that 
was probed to determine mRNA expression of HDAC9 using a cDNA probe that 

10 recognizes both HDA C9 and HDAC9a. The tissues examined are lane 1 , heart; lane 
2, brain; lane 3, placenta; lane 4, lung; lane 5, liver; lane 6, skeletal muscle; lane 7, 
kidney; and lane 8, pancreas. Positions of the RNA size marker in kilobases (kb) are 
indicated to the left of the blot. 

FIG. 6B is a scanned image of an electrophoretic gel showing the results of 

1 5 RT-PCR analyses of mRNA from the same tissues as examined in the Northern blot 
of FIG. 6A to determine the distribution of HDAC9 and HDAC9a mRNA among 
these tissues. PGR products were resolved by agarose gel electrophoresis and 
visualized by ethidium bromide under UV light. A 1-kb DNA ladder was run on 
both sides of the gel with the size (in kb) indicated on the left. On the right side, the 

20 expected products for HDAC9 and HDAC9a are indicated as 9 and 9a, respectively. 
FIG. 7 is a graph of HDAC enzymatic activity of HDAC anti-FLAG- 
immuaoprecipitated proteins isolated from vector control, HDAC9-FLAG, and 
HDAC9a-FLAG transfected 293T cells, as measured in fluorescence units using 
FLUOR DE LYS™ as a substrate in the presence or absence of 1 \xM TSA. Results 

25 are shown as the mean of three independent assays. The inset is a scanned image of 
an anti-FLAG Western blot showing the amount of proteins used in the assay. V, 
Vector control; 9, HDAC9-FLAG; and 9a, HDAC9a-FLAG. 

FIG. 8 is a graph of HDAC enzymatic activity of HDAC anti-FLAG- 
immunoprecipitated proteins isolated from vector control, and HDAC9a-FLAG 

30 (treated with 2 |iM SAHA or left untreated) transfected 293T cells, as measured by 
3 H-acetic acid released from 3 H-histones in the presence or absence of 2 |xM SAHA. 
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Vector control; HDAC9a, HDAC9a-FLAG; and HDAC9a+, HDAC9a-FLAG + 
SAHA 

FIG. 9A shows a scanned image of a Western blot of 293T whole cell lysate 
and anti-FLAG immunoprecipitates from 293T cells transfected with vector, 
5 HDAC9-FLAG or HDAC9a-FLAG using antibodies against MEF2 and FLAG. Top 
panel, anti-MEF2 Western; bottom panel, anti-FLAG Western. L, 293T whole cell 
lysate; V, vector control IP; 9, HDAC9-FLAG IP; 9a, HDAC9a-FLAG IP. 

FIG. 9B is a graph showing the transcription level of p3XMEF2-Zwc in the 
presence or absence of pcDNA3 empty vector (-), pCMV-MEF2C, and/or a vector 
10 encoding pFLAG-HDAC9 or pFLAG-HDAC9a. p3XMEF2~Z*/c (100 ng) andpRL- 
TK (5 ng) were transfected into 293T cells with pcDNA3 empty vector (-) or with 
pCMV-MEF2C (100 ng) (+) along with the indicated amount of pFLAG-HDAC9 or 
pFLAG-HDAC9a. pFLAG empty vector was used to adjust the DNA to an equal 
amount in each transfection. The firefly luciferase activity was first normalized to 
1 5 the co-transfected Renilla luciferase activity and the value for MEF2C alone was 
then set as 1. Results are shown as the mean of three independent transfections +/- 
standard deviation. 

FIG. 10 shows a schematic representation of the HDAC domains of human 
non~Sir2 family HDACs and HDRP. The boxes represent histone deacetylase 
20 (HDAC) domains. 

FIG. 1 1 is a schematic representation of the order in which FIGS. 1 1 A-l IF 
should be viewed. 

FIGS. 1 1 A-l IF show the nucleotide sequence of the vector pFLAG-CMV- 
5b-HDAC9 (VR1) (SEQ ID NO: 14). Lowercase letters are vector backbone, 
25 uppercase letters are HDAC9 sequence. "Acc" was added at the beginning of the 
HDAC9 sequence for translation initiation. 

FIG. 12 is a schematic representation of the order in which FIGS. 12-1 
through 12-66 should be viewed. 

FIGS. 12-1 through 12-66 show the nucleotide sequence of the vector 
30 pFLAG-CMV-5b-HDAC9a (VR2), with restriction enzyme sites indicated (SEQ ID 
NO: 14). 
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FIG. 13 is a schematic representation of the order in which FIGS. 13A-13E 
should be viewed. 

FIGS. 13A-13E show the nucleotide sequence of the vector pFLAG-CMV- 
5b-HDAC9a (VR2) (SEQ ID NO: 15). Lowercase letters are vector backbone, . 
5 uppercase letters are HDAC9a sequence. "Acc" was added at the beginning of the 
HDAC9a sequence for translation initiation. 

FIG. 14 is a schematic representation of the order in which FIGS. 14-1 
through 14-61 should be viewed. 

FIGS. 14-1 through 14-61 show the nucleotide sequence of the vector 
10 pFLAG-CMV-5b-HDAC9a (VR2), with restriction enzyme sites indicated (SEQ ID 
NO: 15). 

DETAILED DESCRIPTION OF THE INVENTION 

A protein designated HDRP (See Zhou et al, Proc. Natl. Acad. Sci. USA, 

15 97:1056-1061 (2000)) (also called MTTR (See Sparrow et al., EMBO J. 18:5085- 
5098(1999); Zhang et al> J. Biol. Chem., 276:35-39 (2001); and Zhang et al, Proc. 
Natl. Acad. Sci. USA, 98:7354-7359 (2001)) that is 50% identical to the N-terminal 
domains of histone deacetylase 4 (HDAC4) and histone deacetylase 5 (HDAC5) was 
recently identified. The cloning and characterization of a novel histone deacetylase, 

20 HDAC9, of which HDRP is an alternatively spliced isoform is described herein. The 
cDNA sequence of HDAC9 is shown in FIGS. 1 A-1C (SEQ ID NO: 1), and the 
HDAC9 amino acid sequence is shown in FIG. 2A (SEQ ID NO: 2). In addition to 
cloning HDAC9, other alternatively spliced isoforms of HDAC9, designated as 
HDAC9a (a polypeptide that is 132 amino acids shorter at the C-terminal end than 

25 HDAC9), and isoforms of HDAC9, HDAC9a, and HDRP polypeptides that lack the 
nuclear localization signal (NLS) in the N-terminal non-catalytic end of HDAC9, 
termed HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS), respectively were 
also identified. The cDNA sequence of HDAC9a is shown in FIGS. 1D-1G (SEQ 
ID NO: 3), and the HDAC9a amino acid sequence is shown in FIG. 2B (SEQ ID 

30 NO: 4). The cDNA sequence of HDAC9 lacking amino acids encoding an NLS 

(HDAC9(ANLS)) is shown in FIGS. 1 J-1L (SEQ ID NO: 5), and the HDAC9 lacking 
an NLS amino acid sequence is shown in FIG. 2C (SEQ ID NO: 6). The cDNA 
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sequence of HDAC9a encoding a polypeptide lacking an NLS (HDA C9a(ANLS)) is 
shown in FIGS. 1M-10 (SEQ ID NO: 7), and the HDAC9a lacking an NLS amino 
acid sequence is shown in FIG. 2D (SEQ ID NO: 8). The cDNA sequence of HDRP 
encoding a polypeptide lacking an NLS (HDRP(ANLS)) is shown in FIGS. 1H-1I 
5 (SEQ ID NO: 9), and the HDRP lacking an NLS amino acid sequence is shown in 
FIG. 2E (SEQ ID NO: 10). 

POLYPEPTIDES OF THE INVENTION 

The present invention features isolated or recombinant HDAC9 polypeptides, 
10 HDAC9a polypeptides, HDAC9(ANLS) polypeptides, HDAC9a(ANLS) 

polypeptides, and HDRP(ANLS) polypeptides, and fragments, derivatives, and 
variants thereof, as well as polypeptides encoded by nucleotide sequences described 
herein (e.g., other variants). As used herein, the term "polypeptide" refers to a 
polymer of amino acids, and not to a specific length; thus, peptides, oligopeptides, 
1 5 and proteins are included within the definition of a polypeptide. 

As used herein, a polypeptide is said to be "isolated," "substantially pure," or 
"substantially pure and isolated" when it is substantially free of cellular material, 
when it is isolated from recombinant or non-recombinant cells, or free of chemical 
precursors or other chemicals when it is chemically synthesized. Typically, the 

20 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide is isolated, substantially pure, or substantially pure and isolated when it 
has a relative increased concentration or activity of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), in comparison to total HDAC 
concentration or activity. Preferably the increased activity or concentration of the 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is at least 
2-fold, more preferably, at least 5-fold, and most preferably, at least 10 fold, in 
comparison to total HDAC concentration or activity. In addition, a polypeptide can 
be joined to another polypeptide with which it is not normally associated in a cell 
(e.g., in a "fusion protein") and still be "isolated," "substantially pure," or 

30 "substantially pure and isolated." An isolated, substantially pure, or substantially 
pure and isolated polypeptide may be obtained, for example, using affinity 
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purification techniques described herein, as well as other techniques described herein 
and known to those skilled in the art. 

By a "histone deacetylase polypeptide" is meant a polypeptide having histone 
deacetylase activity, transcription repression activity, and/or the ability to deacetylate 
5 other substrates, for example, transcription factors, including p53, CoRest, E2F, 
GATA-1, TFIIe, and TFHF that normally have a nuclear or cytoplasmic location in a 
cell. A histone deacetylase polypeptide is also a polypeptide whose activity can be 
inhibited by molecules having HDAC inhibitory activity. These molecules fall into 
four general classes: 1) short-chain fatty acids (e.g., 4-phenylbutyrate and valproic 

10 acid); 2) hydroxamic acids(e.g. SAHA, Pyroxamide, trichostatin A (TSA), 

oxamflatin and CHAPs, such as, CHAP1 and CHAP 31); 3) cyclic tetrapeptides 
(Trapoxin A, Apicidin and Depsipeptide (FK-228, also known as FR901 1228); 4) 
benzamides (e.g., MS-275); and other compounds such as Scriptaid. Examples of 
such compounds can be found in U.S. Patent Nos. 5,369,108, issued on November 

15 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, issued on June 
30, 1998 to Breslow et al, U.S. Patent Nos. 5,055,608, issued on October 8, 1991, 
and 5,175,191, issued on December 29, 1992 to Marks et al, as well as, Yoshida et 
al, Bioessays 17, 423-430 (1995), Saito et al, PNAS USA 96, 4592-4597, (1999), 
Furamai et al, PNAS USA 98 (1), 87-92 (2001), Komatsu et al, Cancer Res. 

20 61(11), 4459-4466 (2001), Su etal, Cancer Res. 60, 3137-3142 (2000), Lee etal, 
Cancer Res. 61(3), 931-934 and Suzuki et al J. Med. Chem. 42(15), 3001-3003 
(1999) the entire content of all of which are hereby incorporated by reference. 
Examples of such histone deacetylase polypeptides include HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), HDRP(ANLS); a substantially pure polypeptide 

25 comprising SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10; and a polypeptide having preferably at least 60%, more preferably, 70%, 
75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to any one of 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, 
as determined using the BLAST program and parameters described herein. 

30 In one embodiment, the histone deacetylase polypeptide has histone 

deacetylase activity, transcription repression activity, the ability to deacetylate 
substrates, or is inhibited by trichostatin A or a hybrid polar compound such as 
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SAHA. In another embodiment, the HDAC9(ANLS) polypeptide has any two of the 
above biological activities. In still another embodiment, the HDAC9(ANLS) 
polypeptide has any three of the above biological activities. In yet another 
embodiment, the HDAC9(ANLS) polypeptide has all of the above biological 
5 activities. 

An HDAC9 polypeptide is a bistone deacetylase polypeptide as described 
above. An HDAC9 polypeptide preferably has at least 60%, more preferably, 70%, 
75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ ID 
NO: 2, as determined using the BLAST program and parameters described herein. 

1 0 An HDAC9 polypeptide is also a polypeptide that comprises the amino acids 

encoded by exons 23, 24, 25 and/or 26, and that does not comprise the amino acids 
encoded by exon 13 of the HDAC9 nucleic acid sequence, as shown in FIGS. 1A- 
1C, FIG. 4, and FIGS. 5A-5D. Preferably, an HDAC9 polypeptide comprises the 
sequence of SEQ ID NO: 2. More preferably, an HDAC9 polypeptide consists of 

15 the sequence of SEQ ID NO: 2. An HDAC polypeptide is also a polypeptide 

comprising the amino acid sequence of the polypeptide encoded by the nucleic acid 
sequence of SEQ ID NO: 1. 

An HDAC9a polypeptide is a histone deacetylase polypeptide as described 
above. An HDAC9a polypeptide preferably has at least 60%, more preferably, 70%, 

20 75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ ID 
NO: 4, as determined using the BLAST program and parameters described herein. 
An HDAC9a polypeptide is also a polypeptide that comprises the amino acids 
encoded by exon 22, and that does not comprise the amino acids encoded by exons 
13, 23, 24, 25, or 26 of the HDAC9 nucleic acid sequence, as shown in FIGS. 1D- 

25 1 G, FIG. 4, and FIGS. 5 A-5D. Preferably, an HDAC9a polypeptide comprises the 
sequence of SEQ ID NO: 4. More preferably, an HDAC9a polypeptide consists of 
the sequence of SEQ ID NO: 4. An HDAC9a polypeptide is also a polypeptide 
comprising the amino acid sequence of the polypeptide encoded by the nucleic acid 
sequence of SEQ ID NO: 3. 

30 An HDAC9(ANLS) is a histone deacetylase polypeptide as described above. 

An HDAC9(ANLS) polypeptide does not comprise a nuclear localization signal 
(NLS). An HDAC9(ANLS) polypeptide preferably has at least 60%, more 
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preferably, 70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence 
identity to SEQ ID NO: 6, as determined using the BLAST program and parameters 
described herein. An HDAC9(ANLS) polypeptide is also a polypeptide that 
comprises the amino acids encoded by exons 23, 24, 25, and/or 26, and that does not 
5 comprise the amino acids encoded by exons 7 or 13 of the HDAC9 nucleic acid 
sequence, as shown in FIGS. 1J-1L, and FIGS. 5A-5D. Preferably, an 
HDAC9(ANLS) polypeptide comprises the sequence of SEQ ID NO: 6. More 
preferably, an HDAC9(ANLS) polypeptide consists of the sequence of SEQ ID NO: 
6. An HDAC9(ANLS) polypeptide is also a polypeptide comprising the amino acid 

10 sequence of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 5. 
An HDAC9a(ANLS) polypeptide is a histone deacetylase polypeptide as 
described above. An HDAC9a(ANLS) does not comprise a nuclear localization 
signal (NLS). An HDAC9a(ANLS) polypeptide preferably has at least 60%, more 
preferably, 70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence 

15 identity to SEQ ID NO: 8, as determined using the BLAST program and parameters 
described herein. An HDAC9a(ANLS) polypeptide is also a polypeptide that 
comprises the amino acids encoded by exon 22, and that does not comprise the 
amino acids encoded by exons 7, 13, 23, 24, 25, or 26 of the HDAC9 nucleic acid 
sequence, as shown in FIGS. 1M-10, and FIGS. 5A-5D. Preferably, an 

20 HDAC9a(ANLS) polypeptide comprises the sequence of SEQ ID NO: 8. More 

preferably, an HDAC9a(ANLS) polypeptide consists of the sequence of SEQ ID NO: 
8. An HDAC9a(ANLS) polypeptide is also a polypeptide comprising the amino acid 
sequence of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 7. 
An HDRP(ANLS) polypeptide is a histone deacetylase polypeptide as 

25 described above. An HDRP(ANLS) does not comprise a nuclear localization signal 
(NLS). An HDRP(ANLS) polypeptide preferably has at least 60%, more preferably, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ 
ID NO: 10, as determined using the BLAST program and parameters described 
herein. An HDRP(ANLS) polypeptide is also a polypeptide that does not comprise 

30 the amino acids encoded by exons 7 or 13-26 of the HDAC9 nucleic acid sequence, 
as shown in FIGS. 1H-1I and FIGS. 5A-5D. Preferably, an HDRP(ANLS) 
polypeptide comprises the sequence of SEQ ID NO: 10. More preferably, an 
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HDRP(ANLS) polypeptide consists of the sequence of SEQ ID NO: 10. An 
HDRP(ANLS) polypeptide is also a polypeptide comprising the amino acid sequence 
of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 9. 

The polypeptides of the invention can be purified to homogeneity. It is. 
5 understood, however, that preparations in which the polypeptide is not purified to 
homogeneity are useful. The critical feature is that the preparation allows for the 
desired function of the polypeptide, even in the presence of considerable amounts of 
other components. Thus, the invention encompasses various degrees of purity. In 
one embodiment, the language "substantially free of cellular material" includes 
10 preparations of the polypeptide having less than about 30% (by dry weight) other 
proteins (i.e., contaminating protein), less than about 20% other proteins, less than 
about 10% other proteins, or less than about 5% other proteins. 

When a polypeptide is recombinantly produced, it can also be substantially 
free of culture medium, i.e., culture medium represents less than about 20%, less 
15 than about 10%, or less than about 5% of the volume of the polypeptide preparation. 
The language "substantially free of chemical precursors or other chemicals" includes 
- preparations of the polypeptide in which it is separated from chemical precursors or 
other chemicals that are involved in its synthesis. In one embodiment, the language 
"substantially free of chemical precursors or other chemicals" includes preparations 
20 of the polypeptide having less than about 30% (by dry weight) chemical precursors 
or other chemicals, less than about 20% chemical precursors or other chemicals, less 
than about 10% chemical precursors or other chemicals, or less than about 5% 
chemical precursors or other chemicals. 

In one embodiment, a polypeptide of the invention comprises an amino acid 
25 sequence encoded by a nucleic acid molecule comprising a nucleotide sequence 

selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9, and complements and portions thereof, (e.g., a 
complement of any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9 or a portion of any one of SEQ ID NO: 1 or SEQ ID NO: 3, 
30 SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9). 

The polypeptides of the invention also encompass fragments and sequence 
variants. Variants include a substantially homologous polypeptide encoded by the 
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same genetic locus in an organism, i.e., an allelic variant, as well as other variants. 
Variants also encompass polypeptides derived from other genetic loci in an 
organism, but having substantial homology to a polypeptide encoded by a nucleic 
acid molecule comprising a nucleotide sequence selected from the group consisting 
5 of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, 
and complements and portions thereof, or having substantial homology to a 
polypeptide encoded by a nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of nucleotide sequences encoding any one of SEQ 
ID NO: 2,' SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. 
10 Variants also include polypeptides substantially homologous or identical to these 
polypeptides but derived from another organism, i.e, 9 an ortholog. Variants also 
include polypeptides that are substantially homologous or identical to these 
polypeptides that are produced by chemical synthesis. Variants also include 
polypeptides that are substantially homologous or identical to these polypeptides that 
1 5 are produced by recombinant methods. 

As used herein, two polypeptides (or a region of the polypeptides) are 
substantially homologous or identical when the amino acid sequences are at least 
about 60-65%, typically at least about 70-75%, more typically at least about 80-85%, 
and most typically greater than about 90-95% or more homologous or identical. A 
20 substantially identical or homologous amino acid sequence, according to the present 
invention, will be encoded by a nucleic acid molecule hybridizing to SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, or a portion thereof, 
under stringent conditions as more particularly described herein, or will be encoded 
by a nucleic acid molecule hybridizing to a nucleic acid sequence encoding SEQ ID 
25 NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, or portion 
thereof, under stringent conditions as more particularly described herein. 

The percent identity of two nucleotide or amino acid sequences can be 
determined by aligning the sequences for optimal comparison purposes (e.g., gaps 
can be introduced in the sequence of a first sequence). The nucleotides or amino 
30 acids at corresponding positions are then compared, and the percent identity between 
the two sequences is a function of the number of identical positions shared by the 
sequences (i.e., % identity = # of identical positions/total # of positions x 100). In 
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certain embodiments, the length of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) amino acid or nucleotide sequence aligned for 
comparison purposes is at least 30%, preferably, at least 40%, more preferably, at 
least 60%, and even more preferably, at least 70%, 80%, 90%, or 100% of the length 
5 of the reference sequence, for example, those sequences provided in FIGS. 1A-10 
and 2A-2E. The actual comparison of the two sequences can be accomplished by 
well-known methods, for example, using a mathematical algorithm. A preferred, 
non-limiting example of such a mathematical algorithm is described in Karlin et al, 
Proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993). Such an algorithm is 
1 0 incorporated into the B1ASTN and BLASTX programs (version 2.2) as described in 
Schaffer et al, Nucleic Acids Res., 29:2994-3005 (2001). When utilizing BLAST 
and Gapped BLAST programs, the default parameters of the respective programs 
(e.g., BLASTN) can be used. See http://www.ncbi.nlm.nih.gov, as available on 
August 10, 2001. In one embodiment, the database searched is a non-redundant 
15 (NR) database, and parameters for sequence comparison can be set at: no filters; 
Expect value of 10; Word Size of 3; the Matrix is BLOSUM62; and Gap Costs have 
an Existence of 1 1 and an Extension of 1. 

Another preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of sequences is the algorithm of Myers and Miller, 
20 CABIOS (1 989). Such an algorithm is incorporated into the ALIGN program 
(version 2.0), which is part of the GCG (Accelrys) sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, 
a PAM120 weight residue table, a gap length penalty of 12 , and a gap penalty of 4 
can be used. Additional algorithms for sequence analysis are known in the art and 
25 include ADVANCE and ADAM as described in Torellis and Robotti, Comput. 
Appl. Biosci., 10: 3-5 (1994); and FASTA described in Pearson and Lipman, Proc. 
Natl. Acad. Sci USA, 85: 2444-8 (1988). 

In another embodiment, the percent identity between two amino acid 
sequences can be accomplished using the GAP program in the GCG software 
30 package (available at http://www.accelrys.com, as available on August 31, 2001) 
using either a Blossom 63 matrix or a PAM250 matrix, and a gap weight of 12, 10, 
8, 6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent 
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identity between two nucleic acid sequences can be accomplished using the GAP 
program in the GCG software package (available at http://www.cgc.com), using a 
gap weight of 50 and a length weight of 3. 

The invention also encompasses HDAC9, HDAC9a, HDAC9(ANLS), . 
5 HDAC9aANLS, and HDRP(ANLS) polypeptides having a lower degree of identity 
but having sufficient similarity so as to perform one or more of the same functions 
performed by an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, or 
HDRP(ANLS) polypeptide encoded by a nucleic acid molecule of the invention. 
Similarity is determined by conserved amino acid substitution. Such substitutions 

10 are those that substitute a given amino acid in a polypeptide by another amino acid 
of like characteristics. Conservative substitutions are likely to be phenotypically 
silent. Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the 
hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and Glu; 

15 substitution between the amide residues Asn and Gin; exchange of the basic residues 
Lys and Arg; and replacements among the aromatic residues Phe and Tyr. Guidance 
concerning which amino acid changes are likely to be phenotypically silent are found 
inBowie etal, Science 247: 1306-1310 (1990). 

A variant polypeptide can differ in amino acid sequence by one or more 

20 substitutions, deletions, insertions, inversions, fusions, and truncations or a 

combination of any of these. Further, variant polypeptides can be fully functional or 
can lack function in one or more activities, for example, in histone deacetylase 
activity or transcription repression activity. Fully functional variants typically 
contain only conservative variation or variation in non-critical residues or in 

25 non-critical regions. Functional variants can also contain substitution of similar 
amino acids that result in no change or an insignificant change in function. 
Alternatively, such substitutions may positively or negatively affect function to some 
degree. Non-functional variants typically contain one or more non-conservative 
amino acid substitutions, deletions, insertions, inversions, or truncations or a 

30 substitution, insertion, inversion, or deletion in a critical residue or critical region, 
such critical regions include the HDAC domains, which provide the polypeptide 
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with deacetylase activity, as shown in the nucleic acid sequences of FIGS. 1A-1G, as 
well as in the schematic of FIG. 4. 

Amino acids that are essential for function can be identified by methods 
known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
5 (Cunningham et al, Science, 244: 1081-1085 (1989)). The latter procedure 
introduces a single alanine mutation at each of the residues in the molecule (one 
mutation per molecule). The resulting mutant molecules are then tested for 
biological activity in vitro. Sites that are critical for polypeptide activity can also be 
determined by structural analysis, such as crystallization, nuclear magnetic 

10 resonance, or photoaffmity labeling (See Smith et al, J. Mol. Biol., 224: 899-904 
(1992); and de Vos et al. Science, 255: 306-312 (1992)). 

The invention also includes HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) polypeptide fragments of the polypeptides of 
the invention. Fragments can be derived from a polypeptide comprising SEQ ID 

15 NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, or from 
a polypeptide encoded by a nucleic acid molecule comprising SEQ ID NO: 1, SEQ 
ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9 or a portion thereof and 
the complements thereof or other variants. The present invention also encompasses 
fragments of the variants of the polypeptides described herein. Useful fragments 

20 include those that retain one or more of the biological activities of the polypeptide as 
well as fragments that can be used as an immunogen to generate polypeptide-specific 
antibodies. 

Biologically active fragments (peptides that are, for example, 6, 9, 12, 15, 16, 
20, 30, 35, 36, 37, 38, 39, 40, 50, 100, or more amino acids in length) can comprise 

25 a domain, segment, or motif, for example, an HDAC domain, that has been 

identified by analysis of the polypeptide sequence using well-known methods, e.g., 
signal peptides, extracellular domains, one or more transmembrane segments or 
loops, ligand binding regions, zinc finger domains, DNA binding domains, acylation 
sites, glycosylation sites, or phosphorylation sites. 

30 Fragments can be discrete (not fused to other amino acids or polypeptides) or 

can be within a larger polypeptide. Further, several fragments can be comprised 
within a single larger polypeptide, hi one embodiment a fragment designed for 
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expression in a host can have heterologous pre- and pro-polypeptide regions fused to 
the amino terminus of the polypeptide fragment and an additional region fused to the 
carboxyl terminus of the fragment. 

The invention thus provides chimeric or fusion polypeptides. These 
5 comprise an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, or HDKP(ANLS) 
polypeptide of the invention operatively linked to a heterologous protein or 
polypeptide having an amino acid sequence not substantially homologous to the 
polypeptide. "Operatively linked" indicates that the polypeptide and the 
heterologous protein are fused in-frame. The heterologous protein can be fused to 

10 the N-terrninus or C-terminus of the polypeptide. In one embodiment, the fusion 
polypeptide does not affect the function of the polypeptide per se. For example, the 
fusion polypeptide can be a GST-fusion polypeptide in which the polypeptide 
sequences are fused to the C-terminus of the GST sequences. Other types of fusion 
polypeptides include, but are not limited to, enzymatic fusion polypeptides, for 

15 example, P-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, 
and Ig fusions. Such fusion polypeptides, particularly poly-His fusions, can 
facilitate the purification of recombinant polypeptide. In certain host cells (e.g., 
mammalian host cells), expression and/or secretion of a polypeptide can be 
increased by using a heterologous signal sequence. Therefore, in another 

20 embodiment, the fusion polypeptide contains a heterologous signal sequence at its 
N-terminus. 

EP-A 0464 533 discloses fusion proteins comprising various portions of 
immunoglobulin constant regions. The Fc is usefiil in therapy and diagnosis and 
thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). 

25 In drug discovery, for example, human proteins have been fused with Fc portions for 
the purpose of high-throughput screening assays to identify antagonists. (See 
Bennett et al. 9 Journal of Molecular Recognition, 8: 52-58 (1995) and Johanson et 
al 9 The Journal of Biological Chemistry, 270,16: 9459-9471 (1995)). Thus, this 
invention also encompasses soluble fusion polypeptides containing a polypeptide of 

30 the invention and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclass (IgG, IgM, IgA, IgE). 
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A chimeric or fusion polypeptide can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional techniques. 
In another embodiment, the fusion gene can be synthesized by conventional 
5 techniques including automated DNA synthesizers. Alternatively, PCR 

amplification of nucleic acid fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive nucleic acid 
fragments that can subsequently be annealed and re-amplified to generate a chimeric 
nucleic acid sequence (see Ausubel et al, "Current Protocols in Molecular Biology," 
10 John Wiley & Sons, (1998), the entire teachings of which are incorporated by 
reference herein). Moreover, many expression vectors are commercially available 
that already encode a fusion moiety (e.g., a GST protein). A nucleic acid molecule 
encoding a polypeptide of the invention can be cloned into such an expression vector 
such that the fusion moiety is linked in-frame to the polypeptide. 
1 5 The substantially pure, isolated, or substantially pure and isolated HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9aANLS, or HDRP(ANLS) polypeptide can be 
purified .from cells that naturally express it, purified from cells that have been altered 
to express it (recombinant), or synthesized using known protein synthesis methods. 
In one embodiment, the polypeptide is produced by recombinant DNA techniques. 
20 For example, a nucleic acid molecule encoding the polypeptide is cloned into an 
expression vector, the expression vector introduced into a host cell, and the 
polypeptide expressed in the host cell. The polypeptide can then be isolated from 
the cells by an appropriate purification scheme using standard protein purification 
techniques. 

25 Li general, HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, and 

HDRP(ANLS) polypeptides of the present invention can be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using art-recognized methods. The polypeptides of the present invention can be 
used to raise antibodies or to elicit an immune response. The polypeptides can also 

30 be used as a reagent, e.g., a labeled reagent, in assays to quantitatively determine 
levels of the polypeptide or a molecule to which it binds (e.g., a receptor or a ligand) 
in biological fluids. The polypeptides can also be used as markers for cells or tissues 
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in which the corresponding polypeptide is preferentially expressed, either 
constitutively, during tissue differentiation, or in a diseased state. The polypeptides 
can be used to isolate a corresponding binding agent, and to screen for peptide or 
small molecule antagonists or agonists of the binding interaction. The polypeptides 
5 of the present invention can also be used as therapeutic agents. 

NUCLEIC ACID MOLECULES OF THE INVENTION 

The present invention also features isolated HDAC9, HDAC9a y 
HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules. 

10 By a "histone deacetylase nucleic acid molecule" is meant a nucleic acid 

molecule that encodes a histone deacetylase polypeptide. Such histone nucleic acids 
include, for example, the HDAC9, HDAC9a 9 HDA C9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) nucleic acid molecule described in detail herein; an isolated nucleic 
acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or 

1 5 SEQ ID NO: 9; a complement of an isolated nucleic acid comprising SEQ ID NO: 1 , 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ED NO: 9; an isolated 
nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; a complement of an 
isolated nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 2, 

20 SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; a nucleic acid 
that is hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes any of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or 
a complement thereof; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 

25 SEQ ID NO: 5, or SEQ ED NO: 7; and an isolated nucleic acid molecule that has at 
least 55%, more preferably, 60% 65%, 70%, 75%, 80%, 85%, or 90%, and most 
preferably, 95% or 99% sequence identity with any one of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, or a complement thereof. 

An HDAC9 nucleic acid molecule is a nucleic acid molecule that encodes an 

30 HDAC9 polypeptide. In one embodiment, the HDAC9 nucleic acid molecule is 
selected from: a nucleic acid molecule that comprises the nucleic acid sequence of 
SEQ ID NO: 1; a complement of an isolated nucleic acid comprising SEQ ID NO: 1; 
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an isolated nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 
2; a complement of an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 2; a nucleic acid that is hybridizeable under high 
stringency conditions to a nucleic acid molecule that encodes SEQ ID NO: 2; a. 
5 nucleic acid molecule that is hybridizeable under high stringency conditions to a 
nucleic acid comprising SEQ ID NO: 1; and an isolated nucleic acid molecule that 
has preferably, at least 55%, more preferably, 60%, 65%, 70%, 75%, 80%, 85%, or 
90%, and most preferably, 95% or 99% sequence identity with SEQ ID NO: 1, as 
determined using the BLAST program and parameters described herein. In another 
1 0 embodiment, the HDAC9 nucleic acid molecule consists of the nucleic acid 
sequence of SEQ ID NO: 1. 

An HDAC9a nucleic acid molecule is a nucleic acid molecule that encodes 
an HDAC9a polypeptide. An HDAC9a nucleic acid molecule preferably has at least 
55%, sequence identity to SEQ ID NO: 3, In one embodiment, the HDAC9a nucleic 
15 acid molecule is selected from: a nucleic acid molecule that comprises the nucleic 
acid sequence of SEQ ID NO: 3; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 3; an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 4; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 4; a nucleic acid that is 
20 hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes SEQ ID NO: 4; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 3; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
25 with SEQ ID NO: 3 or a complement thereof, as determined using the BLAST 
program and parameters described herein. In another embodiment, the HDAC9a 
nucleic acid molecule consists of the nucleic acid sequence of SEQ ID NO: 3. 

An HDAC9(ANLS) nucleic acid molecule is a nucleic acid molecule that 
encodes an HDAC9(ANLS) polypeptide. In one embodiment, the HDAC9(ANLS) 
30 nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 
nucleic acid sequence of SEQ ID NO: 5; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 5; an isolated nucleic acid encoding a histone deacetylase 
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polypeptide of SEQ ID NO: 6; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 6; a nucleic acid that is 
hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes SEQ ID NO: 6; a nucleic acid molecule that is hybridizeable under high 
5 stringency conditions to a nucleic acid comprising SEQ ID NO: 5; and an isolated 
nucleic acid molecule that has preferably, at least 55% more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
with SEQ ID NO: 5 or a complement thereof, as determined using the BLAST 
program and parameters described herein. In another embodiment, the 
1 0 HDA C9(ANLS) nucleic acid molecule consists of the nucleic acid sequence of SEQ 
ID NO: 5. 

An HDAC9a(ANLS) nucleic acid molecule is a nucleic acid molecule that 
encodes an HDAC9a(ANLS) polypeptide. In one embodiment, the HDA C9a(ANLS) 
nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 

15 nucleic acid sequence of SEQ ID NO: 7; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 7; an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ED NO: 8; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 8; a nucleic acid that is 
hybridizeable under high stringency conditions to a nucleic acid molecule that 

20 encodes SEQ ID NO: 8; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 7; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
with SEQ ID NO: 7 or a complement thereof, as determined using the BLAST 

25 program and parameters described herein. In another embodiment, the 

HDAC9a(ANLS) nucleic acid molecule consists of the nucleic acid sequence of SEQ 
ID NO: 7. 

An "HDRP(ANLS) nucleic acid molecule" is a nucleic acid molecule that 
encodes an HDRP(ANLS) polypeptide. In one embodiment, the HDRP(ANLS) 
30 nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 
nucleic acid sequence of SEQ ID NO: 9; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 9; an isolated nucleic acid encoding a histone deacetylase 
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polypeptide of SEQ ID NO: 10; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 10; and an isolated nucleic acid 
molecule that has preferably, at least 55%, more preferably, 60%, 65%, 70%, 75%, 
80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity with SEQ 
5 ID NO: 9 or a complement thereof, as detennined using the BLAST program and 
parameters described herein.. In another embodiment, the HDRP(ANLS) nucleic 
acid molecule consists of the nucleic acid sequence of SEQ ID NO: 9. 

The isolated nucleic acid molecules of the present invention can be RNA, for 
example, mRNA, or DNA, such as cDNA and genomic DNA. DNA molecules can 
1 0 be double-stranded or single-stranded; single stranded RNA or DNA can be either 
the coding, or sense, strand or the non-coding, or antisense, strand. The nucleic acid 
molecule can include all or a portion of the coding sequence of the gene and can 
further comprise additional non-coding sequences such as introns and non-coding 3' 
and 5' sequences (including regulatory sequences, for example). Additionally, the 
1 5 nucleic acid molecule can be fused to a marker sequence, for example, a sequence 
that encodes a polypeptide to assist in isolation or purification of the polypeptide. 
Such sequences include, but are not limited to, those that encode a 
glutathione-S-transferase (GST) fusion protein and those that encode a 
hemagglutinin A (HA) polypeptide marker from influenza. 
20 An "isolated," "substantially pure," or "substantially pure and isolated" 

nucleic acid molecule, as used herein, is one that is separated from nucleic acids that 
normally flank the gene or nucleotide sequence (as in genomic sequences) and/or has 
been completely or partially purified from other transcribed sequences (e.g., as in an 
RNA or cDNA library). For example, an isolated nucleic acid of the invention may 
25 be substantially isolated with respect to the complex cellular milieu in which it 
naturally occurs, or culture medium when produced by recombinant techniques, or 
chemical precursors or other chemicals when chemically synthesized. In some 
instances, the isolated material will form part of a composition (for example, a crude 
extract containing other substances), buffer system, or reagent mix. In other 
30 circumstances, the material may be purified to essential homogeneity, for example, 
as determined by agarose gel electrophoresis or column chromatography such as 
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HPLC. Preferably, an isolated nucleic acid molecule comprises at least about 50, 80, 
or 90% (on a molar basis) of all macromolecular species present. 

With regard to genomic DNA, the term "isolated" also can refer to nucleic 
acid molecules that are separated from the chromosome with which the genomic 
5 DNA is naturally associated. For example, the isolated nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotides 
that flank the nucleic acid molecule in the genomic DNA of the cell from which the 
nucleic acid molecule is derived. 

The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

10 nucleic acid molecule can be fused to other coding or regulatory sequences and still 
be considered isolated. Thus, recombinant DNA contained in a vector is included in 
the definition of "isolated" as used herein. Also, isolated nucleic acid molecules 
include recombinant DNA molecules in heterologous host cells, as well as partially 
or substantially purified DNA molecules in solution. 'Isolated" nucleic acid 

15 molecules also encompass in vivo and in vitro RNA transcripts of the DNA 

molecules of the present invention. An isolated nucleic acid molecule or nucleotide 
sequence can include a nucleic acid molecule or nucleotide sequence that is 
synthesized chemically or by recombinant means. Therefore, recombinant DNA 
contained in a vector are included in the definition of "isolated" as used herein. 

20 Isolated nucleotide molecules also include recombinant DNA molecules in 

heterologous organisms, as well as partially or substantially purified DNA molecules 
in solution. In vivo and in vitro RNA transcripts of the DNA molecules of the 
present invention are also encompassed by "isolated" nucleotide sequences. Such 
isolated nucleotide sequences are useful in the manufacture of the encoded 

25 polypeptide, as probes for isolating homologous sequences (e.g., from other 
mammalian species), for gene mapping (e.g., by in situ hybridization with 
chromosomes), or for detecting expression of the gene in tissue human tissue), 
such as by Northern blot analysis. 

The present invention also pertains to variant HDAC9, HDAC9a, 

30 HDAC9(ANLS), HDA C9a(ANLS), and HDRP(ANLS) nucleic acid molecules that are 
not necessarily found in nature but that encode an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. Thus, for 
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example, DNA molecules that comprise a sequence that is different from the 
naturaUy-occurring HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(dNLS) nucleotide sequence but which, due to the degeneracy of the genetic 
code, encode an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) polypeptide of the present invention are also the subject of this 
invention. 

The invention also encompasses HDAC9, HDAC9a y HDAC9(ANLS) y 
HDAC9a(ANLS) 9 and HDRP(ANLS) nucleotide sequences encoding portions 
(fragments), or encoding variant polypeptides such as analogues or derivatives of an 
10 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide. Such variants can be naturally-occurring, such as in the case of allelic 
variation or single nucleotide polymorphisms, or non-naturally-occuiring, such as 
those induced by various mutagens and mutagenic processes. Intended variations 
include, but are not limited to, addition, deletion, and substitution of one or more 
1 5 nucleotides that can result in conservative or non-conservative amino acid changes, 
including additions and deletions. Preferably, theHDAC9, HDA C 9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLSX nucleotide (and/or resultant 
amino acid) changes are silent or conserved; that is, they do not alter the 
characteristics or activity of the HDAC9, HDAC9a, HDAC9(ANLS), 
20 HDAC9a(ANLS), or HDRP(ANLS) polypeptide. In one preferred embodiment, the 
nucleotide sequences are fragments that comprise one or more polymorphic 
microsatellite markers. 

Other alterations of the HDAC9, HDAC9a 9 HDA C9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS) nucleic acid molecules of the invention can 
25 include, for example, labeling, methylation, internucleotide modifications such as 
uncharged linkages {e.g., methyl phosphonates, phosphotriesters, phosphoamidates, 
and carbamates), charged linkages (e.g., phosphorothioates or phosphorodithioates), 
pendent moieties (e.g., polypeptides), intercalators (e.g., acridine or psoralen), 
chelators, alkylates, and modified linkages (e.g., alpha anomeric nucleic acids). 
30 Also included are synthetic molecules that mimic nucleic acid molecules in the 
ability to bind to a designated sequences via hydrogen bonding and other chemical 
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interactions. Such molecules include, for example, those in which peptide linkages 
substitute for phosphate linkages in the backbone of the molecule. 

The invention also pertains to HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that hybridize under 
5 high stringency hybridization conditions, such as for selective hybridization, to a 
nucleotide sequence described herein {e.g., nucleic acid molecules that specifically 
hybridize to a nucleotide sequence encoding polypeptides described herein, and, 
optionally, have an activity of the polypeptide). In one embodiment, the invention 
includes variants described herein that hybridize under high stringency hybridization 

10 conditions {e.g., for selective hybridization) to a nucleotide sequence comprising a 
nucleotide sequence selected from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9 and the complement of SEQ ID NO: 1, SEQ ID NO: 
3, SEQ ID NO: 5, SEQ D NO: 7, or SEQ ID NO: 9. In another embodiment, the 
invention includes variants described herein that hybridize under high stringency 

15 hybridization conditions {e.g., for selective hybridization) to a nucleotide sequence 
. encoding an amino acid sequence of SEQ ID NO: 2 (HDAC9), SEQ ID NO: 4 
(HDAC9a), SEQ ID NO: 6 (HDAC9(ANLS)), SEQ ID NO: 8 (HDAC9a(ANLS)), or 
SEQ ID NO: 10 (HDRP(ANLS)). In a preferred embodiment, the variant that 
hybridizes under high stringency hybridizations encodes a polypeptide that has a 

20 biological activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide {e.g., histone deacetylase activity or transcription 
repression activity). 

Such nucleic acid molecules can be detected and/or isolated by specific 
hybridization {e.g., under high stringency conditions). "Specific hybridization/ 5 as 

25 used herein, refers to the ability of a first nucleic acid to hybridize to a second 
nucleic acid in a manner such that the first nucleic acid does not hybridize to any 
nucleic acid other than to the second nucleic acid (e.g., when the first nucleic acid 
has a higher similarity to the second nucleic acid than to any other nucleic acid in a 
sample wherein the hybridization is to be performed). "Stringency conditions" for 

30 hybridization is a term of art that refers to the incubation and wash conditions, e.g., 
conditions of temperature and buffer concentration, that permit hybridization of a 
particular nucleic acid to a second nucleic acid; the first nucleic acid may be 
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perfectly (i.e., 100%) complementary to the second, or the first and second may 
share some degree of complementarity that is less than perfect (e.g., 70%, 75%, 
85%, 95%). For example, certain high stringency conditions can be used that 
distinguish perfectly complementary nucleic acids from those of less 
5 complementarity. "High stringency conditions," "moderate stringency conditions," 
and "low stringency conditions" for nucleic acid hybridizations are explained on 
pages 2.10.1-2.10.16 and pages 6.3.1-6.3.6 in Current Protocols in Molecular 
Biology (See Ausubel et al, supra, the entire teachings of which are incorporated by 
reference herein). The exact conditions that determine the stringency of 
10 hybridization depend not only on ionic strength (e.g., 0.2XSSC or 0.1XSSC), 
temperature (e.g, room temperature, 42°C or 68°C), and the concentration of 
destabilizing agents such as formamide or denaturing agents such as SDS, but also 
on factors such as the length of the nucleic acid sequence, base composition, percent 
mismatch between hybridizing sequences, and the frequency of occurrence of 
15 subsets of that sequence within other non-identical sequences. Thus, equivalent 
conditions can be determined by varying one or more of these parameters while 
mamtaining a similar degree of identity or similarity between the two nucleic acid 
molecules. Typically, conditions are used such that sequences at least about 60%, at 
least about 70%, at least about 80%, at least about 90% or at least about 95% or 
20 more identical to each other remain hybridized to one another. By varying 

hybridization conditions from a level of stringency at which no hybridization occurs 
to a level at which hybridization is first observed, conditions that will allow a given 
sequence to hybridize (e.g., selectively) with the most similar sequences in the 
sample can be determined. 
25 Exemplary conditions are described in Krause and Aaronson, Methods in 

Enzymology, 200:546-556 (1991). Also, in, Ausubel, et al, supra, which describes 
the determination of washing conditions for moderate or low stringency conditions. 
Washing is the step in which conditions are usually set so as to determine a 
niinimum level of complementarity of the hybrids. Generally, starting from the 
30 lowest temperature at which only homologous hybridization occurs, each °C by 
which the final wash temperature is reduced (holding SSC concentration constant) 
allows an increase by 1% in the maximum extent of mismatching among the 
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sequences that hybridize. Generally, doubling the concentration of SSC results in an 
increase in Tm of 17°C. Using these guidelines, the washing temperature can be 
determined empirically for high, moderate, or low stringency, depending on the level 
of mismatch sought. 

5 For example, a low stringency wash can comprise washing in a solution 

containing 0.2XSSC/0.1% SDS for 10 minutes at room temperature; a moderate 
stringency wash can comprise washing in a prewarmed solution (42°C) solution 
containing 0.2XSSC/0.1% SDS for 15 minutes at 42°C; and a high stringency wash 
can comprise washing in prewarmed (68°C) solution containing 0.1XSSC/0.1%SDS 

10 for 1 5 minutes at 68°C. Furthermore, washes can be performed repeatedly or 

sequentially to obtain a desired result as known in the art. Equivalent conditions can 
be determined by varying one or more of the parameters given as an example, as 
known in the art, while maintaining a similar degree of identity or similarity between 
the target nucleic acid molecule and the primer or probe used. 

15 To determine the percent homology or identity of two nucleic acid 

sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in the sequence of one polypeptide or nucleic acid molecule for 
optimal alignment with the other polypeptide or nucleic acid molecule). The amino 
acid residues or nucleotides at corresponding amino acid positions or nucleotide 

20 positions are then compared, as described above. 

The present invention also provides isolated HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that ' 
contain a fragment or portion that hybridizes under highly stringent conditions to a 
nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO: 1, 

25 SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and the complement 
of any of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID 
NO: 9 and also provides isolated nucleic acid molecules that contain a fragment or 
portion that hybridizes under highly stringent conditions to a nucleotide sequence 
encoding an amino acid sequence selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ 

30 ID NO: 6, SEQ ID NO: 8, and SEQ ID NO: 10. The nucleic acid fragments of the 
invention are at least about 15, preferably, at least about 18, 20, 23, or 25 
nucleotides, and can be 30, 40, 50, 100, 200 or more nucleotides in length. Longer 
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fragments, for example, 30 or more nucleotides in length, that encode antigenic 
polypeptides described herein are particularly useful, such as for the generation of 
antibodies as described above. 

In a related aspect, the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
5 and HDRP(ANLS) nucleic acid fragments of the invention are used as probfcs or 
primers in assays such as those described herein. ''Probes" or "primers" are 
oligonucleotides that hybridize in a base-specific manner to a complementary strand 
of nucleic acid molecules. Such probes and primers include polypeptide nucleic 
acids, as described in Nielsen et al 9 Science, 254, 1497-1500 (1991). As also used 
10 herein, the term "primer" in particular refers to a single-stranded oligonucleotide that 
acts as a point of initiation of template-directed DNA synthesis using well-known 
methods (e.g., PCR, LCR) including, but not limited to those described herein. 

Typically, a probe or primer comprises a region of nucleotide sequence that 
hybridizes to at least about 15, typically about 20-25, and more typically about 40, 
15 50 or 75, consecutive nucleotides of a nucleic acid molecule comprising a 

contiguous nucleotide sequence selected from: SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, the complement of any of SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and a sequence 
encoding an amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
20 SEQ ID NO: 8, or SEQ ID NO: 10. 

In preferred embodiments, a probe or primer comprises 100 or fewer 
nucleotides, preferably, from 6 to 50 nucleotides, and more preferably, from 12 to 30 
nucleotides. In other embodiments, the probe or primer is at least 70% identical to 
the contiguous nucleotide sequence or to the complement of the contiguous 
25 nucleotide sequence, preferably, at least 80% identical, more preferably, at least 90% 
identical, even more preferably, at least 95% identical, or even capable of selectively 
hybridizing to the contiguous nucleotide sequence or to the complement of the 
contiguous nucleotide sequence. Often, the probe or primer further comprises a 
label, e.g., radioisotope, fluorescent compound, enzyme, or enzyme co-factor. 
30 The nucleic acid molecules of the invention such as those described above 

can be identified and isolated using standard molecular biology techniques and the 
sequence information provided in SEQ ID NO: 1, SEQ ID NO; 3, SEQ ID NO: 5, 
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SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 8, and /or SEQ ID NO: 10. For example, nucleic acid molecules can 
be amplified and isolated by the polymerase chain reaction using synthetic 
oligonucleotide primers designed based on one or more of the nucleic acid 
5 sequences provided above and/or the complement of those sequences. Or such 
nucleic acid molecules may be designed based on nucleotide sequences encoding 
one or more of the amino acid sequences provided in SEQ ID NO: 2, SEQ ID NO: 4, 
SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. See generally PCR Technology: 
Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, 

10 NY, NY, (1992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis 
et al., Academic Press, San Diego, CA, (1990); Mattila et al., Nucleic Acids Res., 
19: 4967 (1991); Eckert et al, PCR Methods and Applications, 1: 17 (1991); PCR 
(eds. McPherson et al, IRL Press, Oxford)); and U.S. Patent No. 4,683,202. The 
nucleic acid molecules can be amplified using cDNA, mRNA, or genomic DNA as a 

15 template, cloned into an appropriate vector and characterized by DNA sequence 
analysis. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(See Wu and Wallace, Genomics, 4:560 (1989), Landegren etal, Science, 241:1077 
(1988)), transcription amplification (Kwoh et al, Proc. Natl. Acad. Sci. USA, 

20 86:1 173 (1989)), and self-sustained sequence replication (See Guatelli et ah, Proc. 
Nat. Acad. Sci. USA, 87:1874 (1990)) and nucleic acid based sequence 
amplification (NASBA). The latter two amplification methods involve isothermal 
reactions based on isothermal transcription, that produce both single stranded RNA 
(ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio 

25 of about 30 or 100 to 1, respectively. 

The amplified DNA can be radiolabeled and used as a probe for screening a 
cDNA library derived from human cells, mRNA in zap express, ZIPLOX, or other 
suitable vector. Corresponding clones can be isolated, DNA can be obtained 
following in vivo excision, and the cloned insert can be sequenced in either or both 

30 orientations by art-recognized methods to identify the correct reading frame 

encoding a polypeptide of the appropriate molecular weight. For example, the direct 
analysis of the nucleotide sequence of nucleic acid molecules of the present 
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invention can be accomplished using well-known methods that are commercially 
available. See, for example, Sambrook et al, Molecular Cloning, A Laboratory 
Manual (2nd Ed., CSHP, New York (1989)); Zyskind et ah, Recombinant DNA 
Laboratory Manual, (Acad. Press, (1988)). Using these or similar methods, the 
5 polypeptide and the DNA encoding the polypeptide can be isolated, sequenced, and 
further characterized. 

Antisense nucleic acid molecules of the invention can be designed using the 
nucleotide sequences of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9 and/or the complement of any of SEQ ID NO: 1, SEQ ID NO: 
10 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 and/or a portion of those 

sequences, and/or the complement of those portion or sequences, and/or a sequence 
encoding the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 8, SEQ ID NO: 10, or encoding a portion of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. Such antisense nucleic 
15 acid molecules can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid 
molecule (e.g., an antisense oligonucleotide) can be chemically synthesized using 
naturally occurring nucleotides or variously modified nucleotides designed to 
increase the biological stability of the molecules or to increase the physical stability 
20 of the duplex formed between the antisense and sense nucleic acids, e.g. 9 

phosphorothioate derivatives and acridine substituted nucleotides can be used. 
Alternatively, the antisense nucleic acid molecule can be produced biologically using 
an expression vector into which a nucleic acid molecule has been subcloned in an 
antisense orientation (Le., RNA transcribed from the inserted nucleic acid molecule 
25 will be of an antisense orientation to a target nucleic acid of interest). 

In general, the isolated HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS) 9 
and HDRP(ANLS) nucleic acid sequences of the invention can be used as molecular 
weight markers on Southern blots, and as chromosome markers that are labeled to 
map related gene positions. The nucleic acid sequences can also be used to compare 
30 with endogenous DNA sequences in patients to identify genetic disorders (e.g., a 
predisposition for or susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease), and as probes, such as to hybridize and 
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discover related DNA sequences or to subtract out known sequences from a sample. 
The nucleic acid molecules of the present invention can also be used as therapeutic 
agents. 

By a "cell proliferation disease" is meant a disease that is caused by or results 
5 in undesirably high levels of cell division, undesirably low levels of apoptosis, or 
both. For example, cancers such as lymphoma, leukemia, melanoma, ovarian 
cancer, breast cancer, pancreatic cancer, prostate cancer, colon cancer, and lung 
cancer are all examples of cell proliferation diseases. Myeloproliferative disorders, 
including polycythemia vera, essential thrombocythemia, agnogenic myeloid 
10 metaplasia, and chronic myelogenous leukemia are also cell proliferation diseases. 

By a "cell differentiation disease" is meant a disease that is caused by or 
results in undesirably low levels of cell differentiation, or by undesirably high levels 
of cell differentiation. For example, cancers such as lymphoma, leukemia, 
melanoma, ovarian cancer, breast cancer, pancreatic cancer, prostate cancer, colon 
15 cancer, and lung cancer are all examples of cell differentiation diseases. 
Myeloproliferative disorders, including polycythemia vera, essential 
thrombocythemia, agnogenic myeloid metaplasia, and chronic myelogenous 
leukemia are also cell differentiation diseases. 

By an "apoptotic disease" is meant a condition in which the apoptotic 
20 response is abnormal. This may pertain to a cell or a population of cells that does 
not undergo cell death under appropriate conditions. For example, normally a cell 
will die upon exposure to apoptotic-triggering agents, such as chemotherapeutic 
agents, or ionizing radiation. When, however, a subject has an apoptotic disease, for 
example, cancer, the cell or a population of cells may not undergo cell death in 
25 response to contact with apoptotic-triggering agents. In addition, a subject may have 
an apoptotic disease when the occurrence of cell death is too low, for example, when 
the number of proliferating cells exceeds the number of cells undergoing cell death, 
as occurs in cancer when such cells do not properly differentiate. 

An apoptotic disease may also be a condition characterized by the occurrence 
30 of undesirably high levels of apoptosis. For example, certain neurodegenerative 
diseases, including but not limited to Alzheimer's disease, Parkinson's disease, 
amyotrophic lateral sclerosis, multiple sclerosis, restenosis, stroke, and ischemic 
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brain injury are apoptotic diseases in which neuronal cells undergo undesired cell 
death. 

Other diseases for which the polypeptides and nucleic acid molecules of the 
present invention may be useful for diagnosing and/or treating include, but are not 
5 limited to Huntington's disease. 

The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) nucleic acid molecules of the present invention can further be used to 
derive primers for genetic fingerprinting, to raise anti-polypeptide antibodies using 
DNA immunization techniques, and as an antigen to raise anti-DNA antibodies or 
10 elicit immune responses. Portions or fragments of the nucleotide sequences 

identified herein (and the corresponding complete gene sequences) can be used in 
numerous ways as polynucleotide reagents. For example, these sequences can be 
used to: (i) map their respective genes on a chromosome; and, thus, locate gene 
regions associated with genetic disease; (ii) identify an individual from a minute 
15 biological sample (tissue typing); and (iii) aid in forensic identification of a 
biological sample. 

In addition, the HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), and 
HDRP(ANLS) nucleotide sequences of the invention can be used to identify and 
express recombinant polypeptides for analysis, characterization, or therapeutic use, 

20 or as markers for tissues in which the corresponding polypeptide is expressed, either 
constitutively, during tissue differentiation, or in diseased states. The nucleic acid 
sequences can additionally be used as reagents in the screening and/or diagnostic 
assays described herein, and can also be included as components of kits (e.g., 
reagent kits) for use in the screening and/or diagnostic assays described herein. 

25 Standard techniques, such as the polymerase chain reaction (PCR) and DNA 

hybridization, may be used to clone HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a( ANLS), or HDRP(ANLS) homologs in other species, for example, 
mammalian homologs. HDAC9, HDAC9a, HDAC9(ANLS) y HDA C9a(ANLS), or 
HDRP(ANLS) homologs may be readily identified using low-stringency DNA 

30 hybridization or low-stringency PCR with human HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) probes or primers. Degenerate 
primers encoding human HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
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HDRP(ANLS) polypeptides may be used to clone HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) homologs by RT-PCR. 

Alternatively, additional HDAC9, HDAC9a, HDA C9(ANLS), 
HDA C9a( ANLS), or HDRP(ANLS) homologs can be identified by utilizing . 
5 consensus sequence information for HDAC9, HDAC9a, HDAC9(ANLS), 

HDAC9a(ANLS), or HDRP(ANLS) polypeptides to search for similar polypeptides 
in other species. For example, polypeptide databases for other species can be 
searched for proteins with the HDAC domains described herein. Candidate 
polypeptides containing such a motif can then be tested for their HDAC9, HDAC9a, 
1 0 HD AC9(ANLS), HD AC9a(ANLS), or HDRP(ANLS) biological activities, using 
methods described herein. 

EXPRESSION OF THE NUCLEIC ACID MOLECULES OF THE INVENTION 

Another aspect of the invention pertains to nucleic acid constructs containing 
15 an HDAC9, HDAC9a y HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS) nucleic 
acid molecule, for example, one selected from the group consisting of SEQ ID NO: 
1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and the 
complement of any of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
7, or SEQ ID NO: 9 (or portions thereof). Yet another aspect of the invention 

20 pertains to HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS) , and HDRP(ANLS) 
nucleic acid constructs containing a nucleic acid molecule encoding the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10. The constructs comprise a vector (e.g., an expression vector) into which 
a sequence of the invention has been inserted in a sense or antisense orientation. 

25 As used herein, the term "vector" or "construct" refers to a nucleic acid 

molecule capable of transporting another nucleic acid to which it has been linked. 
One type of vector is a "plasmid," which refers to a circular double stranded DNA 
loop into which additional DNA segments can be ligated. Another type of vector is 
a viral vector, wherein additional DNA segments can be ligated into the viral 

30 genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (eg., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
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mammalian vectors) are integrated into the genome of a host cell upon introduction 
into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors, expression vectors, are capable of directing the expression of genes 
to which they are operably linked. In general, expression vectors of utility in 
5 recombinant DNA techniques are often in the form of plasmids. However, the 

invention is intended to include such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated 
viruses) that serve equivalent functions. 

Preferred recombinant expression vectors of the invention comprise a nucleic 
10 acid molecule of the invention in a form suitable for expression of the nucleic acid 
molecule in a host cell. This means that the recombinant expression vectors include 
one or more regulatory sequences, selected on the basis of the host cells to be used 
for expression, which is operably linked to the nucleic acid sequence to be 
expressed. Within a recombinant expression vector, "operably linked" is intended to 
15 mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) 
in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro 
transcription/translation system or in a host cell when the vector is introduced into 
the host cell). The term "regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., polyadenylation signals). 

20 Such regulatory sequences are described, for example, in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those that direct constitutive expression of a 
nucleotide sequence in many types of host cell and those that direct expression of the 
nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory 

25 sequences). 

It will be appreciated by those skilled in the art that the design of the 
expression vector can depend on such factors as the choice of the host cell to be 
transformed and the level of expression of polypeptide desired. The expression 
vectors of the invention can be introduced into host cells to thereby produce 

30 polypeptides, including fusion polypeptides, encoded by nucleic acid molecules as 
described herein. 
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The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells, such as E. coli, insect cells (using baculovirus expression vectors), 
yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
5 supra. Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example, using T7 promoter regulatory sequences and T7 
polymerase. 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms 

10 "host cell" and "recombinant host cell" are used interchangeably herein. It is 

understood that such terms refer not only to the particular subject cell but also to the 
progeny or potential progeny of such a cell. Because certain modifications may 
occur in succeeding generations due to. either mutation or environmental influences, 
such progeny may not, in fact, be identical to the parent cell, but are still included 

1 5 within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic 
acid molecule of the invention can be expressed in bacterial cells (e.g,E. coli\ 
insect cells, yeast, or mammalian cells (such as Chinese hamster ovary cells (CHO) 
or COS cells, human 293T cells, HeLa cells, N1H 3T3 cells, and mouse 

20 erythroleukemia (MEL) cells). Other suitable host cells are known to those skilled 
in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and 'transfection" are intended to refer to a variety of 

25 art-recognized techniques for introducing a foreign nucleic acid molecule (e.g, 
DNA) into a host cell, including calcium phosphate or calcium chloride 
co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be 
found in Sambrook, et al (supra), and other laboratory manuals. 

30 For stable transfection of mammalian cells, it is known that, depending upon 

the expression vector and transfection technique used, only a small fraction of cells 
may integrate the foreign DNA into their genome. In order to identify and select 



WO 02/102984 



-39- 



PCT/US02/19051 



these integrants, a gene that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells along with the gene of interest. 
Preferred selectable markers include those that confer resistance to drags, such as 
G418, hygromycin, or methotrexate. Nucleic acid molecules encoding a selectable 
5 marker can be introduced into a host cell on the same vector as the nucleic acid 
molecule of the invention or can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid molecule can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, 
while the other cells die). 
1 0 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (le., express) a polypeptide of the invention. 
Accordingly, the invention further provides methods for producing a polypeptide 
using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a recombinant expression vector 
15 encoding a polypeptide of the invention has been introduced) in a suitable medium 
such that the polypeptide is produced. In another embodiment, the method further 
comprises isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention is 
20 a fertilized oocyte or an embryonic stem cell into which an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule of the 
invention has been introduced. Such host cells can then be used to create 
non-human transgenic animals in which exogenous nucleotide sequences have been 
introduced into the genome or homologous recombinant animals in which 
25 endogenous nucleotide sequences have been altered. Such animals are useful for 
studying the function and/or activity of the nucleotide sequence and polypeptide 
encoded by the sequence and for identifying and/or evaluating modulators of their 
activity. 

As used herein, a "transgenic animal" is a non-human animal, preferably, a 
30 mammal, more preferably, a rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens, and amphibians. A 
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transgene is exogenous DNA that is integrated into the genome of a cell from which 
a transgenic animal develops and that remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used herein, a "homologous 
5 recombinant animal" is a non-human animal, preferably, a mammal, more 

preferably, a mouse, in which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

1 0 Methods for generating transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191, and in Hogan, Manipulating the Mouse Embryo (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1986)). Methods for 

1 5 constructing homologous recombination vectors and homologous recombinant 
animals are described further in Bradley, Current Opinion in Bio/Technology, 
2:823-829 (1991) and in PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 
92/0968, and WO 93/04169. Clones of the non-human transgenic animals described 
herein can also be produced according to the methods described in Wihnut et al, 

20 Nature, 385:810-813 (1997) and PCT Publication Nos. WO 97/07668 and WO 
97/07669. 

ANTIBODIES OF THE INVENTION 

Polyclonal and/or monoclonal antibodies that selectively bind one form of an 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide but not another form of the polypeptide are also provided. Antibodies 
are also provided that bind a portion of either the variant or reference HDAC9, 
HDAC9a, HD AC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide that 
contains the polymorphic site or sites. 

30 In another aspect, the invention provides antibodies to each of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) polypeptides and 
polypeptide fragments of the invention, e.g., having an amino acid sequence encoded 
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by SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, 
or a portion thereof, or having an amino acid sequence encoded by a nucleic acid 
molecule comprising all or a portion of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 7, or SEQ ID NO: 9, {e.g., SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
5 NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, or another variant, or portion thereof). 
The term "purified antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of immunoglobulin molecules, i.e., 
molecules that contain an antigen binding site that selectively binds an antigen. A 
molecule that selectively binds to a polypeptide of the invention is a molecule that 
1 0 binds to that polypeptide or a fragment thereof, but does not substantially bind other 
molecules in a sample, e.g., a biological sample that naturally contains the 
polypeptide. Preferably the antibody is at least 60%, by weight, free from proteins 
and naturally occurring organic molecules with which it naturally associated. More 
preferably, the antibody preparation is at least 75% or 90%, and most preferably, 
15 99%, by weight, antibody. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 fragments that can be 
generated by treating the antibody with an enzyme such as pepsin. 

The invention provides polyclonal and monoclonal antibodies that selectively 
bind to an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
20 polypeptide of the invention. The term "monoclonal antibody" or "monoclonal 
antibody composition," as used herein, refers to a population of antibody molecules 
that contain only one species of an antigen binding site capable of immunoreacting 
with a particular epitope of a polypeptide of the invention. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular 
25 polypeptide of the invention with which it immunoreacts. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a desired immunogen, e.g., an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide of the invention or 
fragment thereof. The antibody titer in the immunized subject can be monitored 
30 over time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized polypeptide. If desired, the antibody molecules 
directed against the polypeptide can be isolated from the mammal (e.g., from the 
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blood) and further purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. 

At an appropriate time after immunization, e.g. , when the antibody titers are 
highest, antibody-producing cells can be obtained from the subject and used to 
5 prepare monoclonal antibodies by standard techniques, such as the hybridoma 

technique originally described by Kohler and Milstein, Nature, 256:495-497 (1975), 
the human B cell hybridoma technique (Kozbor et aL, Immunol Today, 4:72 
(1983)), the EBV-hybridoma technique (Cole et aL, Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)) or trioma techniques. The 

10 technology for producing hybridomas is well known (see generally Current Protocols 
in Immunology, Coligan et aL, (eds.) John Wiley & Sons, Inc., New York, NY 
(1994)). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with an immunogen as described 
above, and the culture supernatants of the resulting hybridoma cells are screened to 

1 5 identify a hybridoma producing a monoclonal antibody that binds a polypeptide of 
the invention. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating a monoclonal 
antibody to a polypeptide of the invention (see, e.g., Current Protocols in 

20 Immunology, supra; Galfre et aL, (1977) Nature, 266:55052; R.H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum 
Publishing Corp., New York, New York (1980); and Lemer, Yale J. Biol. Med., 
54:387-402 (1981)). Moreover, the ordinarily skilled worker will appreciate that 
there are many variations of such methods that also would be useful. 

25 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody to an HDAC9, HD AC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS) polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library {e.g., an antibody 
phage display library) with the polypeptide to thereby isolate immunoglobulin 

30 library members that bind the polypeptide. Kits for generating and screening phage 
display libraries are commercially available (e.g. 9 the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01 ; and the Stratagene SurfZAP™ Phage 
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Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library 
can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791; 
5 PCT Publication No. WO 92/1 5679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 
Publication No. WO 90/02809; Fuchs et al, Bio/Technology, 9:1370-1372 (1991); 
Hay et al, Hum. Antibod. Hybridomas, 3:81-85 (1992); Huse et al, Science, 
246:1275-1281 (1989); and Griffiths et al, EMBO J., 12:725-734 (1993). 
10 Additionally, recombinant antibodies, such as chimeric and humanized 

monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 
invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. 
15 In general, antibodies of the invention (e.g., a monoclonal antibody) can be 

used to isolate an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide of the invention by standard techniques, such as affinity 
chromatography or immunoprecipitation. A polypeptide-specific antibody can 
facilitate the purification of natural polypeptide from cells and of recombinantly 
20 produced polypeptide expressed in host cells. Moreover, an antibody specific for an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide of the invention can be used to detect the polypeptide (e.g., in a cellular 
lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and 
pattern of expression of the polypeptide. 
25 The antibodies of the present invention can also be used diagnostically to 

monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent 
30 materials, luminescent materials, bioluminescent materials, and radioactive 

materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, p-galactosidase, and acetylcholinesterase; examples of suitable 
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prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples 
of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride and 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
5 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 1, 13 % 35 S, and 3 H. 

DIAGNOSTIC AND SCREENING ASSAYS OF THE INVENTION 

The present invention also pertains to diagnostic assays for assessing HDAC 

1 0 9 HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS) gene expression, or 
for assessing activity of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptides of the invention. In one embodiment, the assays are 
used in the context of a biological sample (e.g., blood, serum, cells, tissue) to 
thereby determine whether an individual is afflicted with a cell proliferation disease, 

15 an apoptotic disease, or a cell differentiation disease, or is at risk for (has a 

predisposition for or a susceptibility to) developing a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is 
susceptible to developing a cell proliferation disease, an apoptotic disease, or a cell 

20 differentiation disease. For example, mutations in the HDAC9, HDAC9a 9 

HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule can be 
assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to thereby prophylactically treat an individual prior to the onset of 
symptoms associated with a cell proliferation disease, an apoptotic disease, or a cell 

25 differentiation disease. 

Another aspect of the invention pertains to assays for monitoring the 
influence of agents, or candidate compounds (e.g., drugs or other agents) on the 
nucleic acid molecule expression or biological activity of polypeptides of the 
invention, as well as to assays for identifying candidate compounds that bind to an 

30 HDAC9, HDAC9a polypeptide, an HDAC9(ANLS) polypeptide, an 

HDAC9a(ANLS) polypeptide, or an HDRP(ANLS) polypeptide. These and other 
assays and agents are described in further detail in the following sections. 
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DIAGNOSTIC ASSAYS 

HDAC9, HDAC9a y HDAC9(ANLS) 9 HDAC9a(ANLS) 9 or HDRP(ANLS) 
nucleic acid molecules, probes, primers, polypeptides, and antibodies to an HDAC9, 
5 an HDAC9a protein, an HD AC9(ANLS) protein, an HDAC9a(ANLS) protein, or an 
HDRP(ANLS) protein can be used in methods of diagnosis of a susceptibility to, or 
likelihood of having a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease, as well as in kits useful for diagnosis of a susceptibility to a 
cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 
1 0 In one embodiment of the invention, diagnosis of a decreased susceptibility 

to a cell proliferation disease, an apoptotic disease, or a cell differentiation disease is 
made by detecting a polymorphism in HDAC9, HDAC9a 9 HDAC9(ANLS) 9 
HDAC9a(ANLS) 9 or HDRP(ANLS). The polymorphism can be a mutation in 
HDAC9, HDAC9a 9 HDAC9(ANLS) 9 HDA C9a(ANLS) 9 or HDRP(ANLS) 9 such as the 
1 5 insertion or deletion of a single nucleotide, or of more than one nucleotide, resulting 
in a frame shift mutation; the change of at least one nucleotide, resulting in a change 
in the encoded amino acid; the change of at least one nucleotide, resulting in the 
generation of a premature stop codon; the deletion of several nucleotides, resulting 
in a deletion of one or more amino acids encoded by the nucleotides; the insertion of 
20 one or several nucleotides, such as by unequal recombination or gene conversion, 
resulting in an interruption of the coding sequence of the gene; duplication of all or a 
part of the gene; transposition of all or a part of the gene; or rearrangement of all or a 
part of the gene, or a change in the expression pattern of the various HDAC9 
isoforms. More than one such mutation may be present in a single nucleic acid 
25 molecule. 

Such sequence changes cause a mutation in the polypeptide encoded by 
HDAC9, HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS) 9 ox HDRP(ANLS). For 
example, if the mutation is a frame shift mutation, the frame shift can result in a 
change in the encoded amino acids, and/or can result in the generation of a 
30 premature stop codon, causing generation of a truncated polypeptide. Alternatively, 
a polymorphism associated with a decreased susceptibility to a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease can be a synonymous 
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mutation in one or more nucleotides {i.e., a mutation that does not result in a change 
in the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide). Such a polymorphism may alter sites, affect the stability or transport 
of mRNA, or otherwise affect the transcription or translation of the nucleic acid 
5 molecule. HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS) 
that has any of the mutations described above is referred to herein as a "mutant 
nucleic acid molecule." 

In a first method of diagnosing a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease, 
10 hybridization methods, such as Southern analysis, Northern analysis, or in situ 
hybridizations, can be used (see Ausubel, et al., supra). For example, a biological 
sample from a test subject (a "test sample") of genomic DNA, RNA, or cDNA, is 
obtained from an individual suspected of having, being susceptible to or predisposed 
for, or carrying a defect for, a cell proliferation disease, an apoptotic disease, or a 
15 cell differentiation disease (the "test individual"). The individual can be an adult, 
child, or fetus. The test sample can be from any source that contains genomic DNA, 
such as a blood sample, sample of amniotic fluid, sample of cerebrospinal fluid, or 
tissue sample from skin, muscle, buccal or conjunctival mucosa, placenta, 
gastrointestinal tract, or other organs. A test sample of DNA from fetal cells or 
20 tissue can be obtained by appropriate methods, such as by amniocentesis or 

chorionic villus sampling. The DNA, RNA, or cDNA sample is then examined to 
determine whether a polymorphism in HDAC9, HDAC9a, HDAC9(ANLS) 9 
HDA C9a(ANLS), or HDRP(ANLS) is present, and/or to determine which variant(s) 
encoded by HDAC9, HDAC9a 9 HBAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
25 is present. The presence of the polymorphism or variant(s) can be indicated by 
hybridization of the gene in the genomic DNA, RNA, or cDNA to a nucleic acid 
probe. A "nucleic acid probe," as used herein, can be a DNA probe or an RNA 
probe; the nucleic acid probe can contain at least one polymorphism in HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) or contains a nucleic 
30 acid encoding a particular variant of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The probe can be any of the nucleic acid 
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molecules described above (e.g., the entire nucleic acid molecule, a fragment, a 
vector comprising the gene, a probe, or primer, etc.). 

To diagnose a decreased susceptibility to a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease, a hybridization sample is formed 
5 by contacting the test sample containing HDAC9, HDAC9a 9 HDAC9(ANLS), 

HDAC9a( ANLS), or HDRP(ANLS), with at least one nucleic acid probe. A preferred 
probe for detecting mRNA or genomic DNA is a labeled nucleic acid probe capable 
of hybridizing to HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) mRNA or genomic DNA sequences described herein. The nucleic 

10 acid probe can be, for example, a full-length nucleic acid molecule, or a portion 
thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250, or 500 
nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to appropriate mRNA or genomic DNA. For example, the nucleic acid 
probe can be all or a portion of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ 

15 ID NO: 7, SEQ ID NO: 9, or the complement of SEQ ID NO: 1 or SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9; or can be a nucleic acid molecule 
. encoding all or a portion of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 8, or SEQ ID NO: 10. Other suitable probes for use in the diagnostic assays of 
the invention are described above (see. e.g., probes and primers discussed under the 

20 heading, <c Nucleic Acids of the Invention"). 

The hybridization sample is maintained under conditions that are sufficient to 
allow specific hybridization of the nucleic acid probe to HDAC9, HDAC9a, 
HDAC9(ANLS), HDA C9a ( ANLS) , qtHDRP(ANLS). "Specific hybridization," as 
used herein, indicates exact hybridization (e.g., with no mismatches). Specific 

25 hybridization can be performed under high stringency conditions or moderate 

stringency conditions, for example, as described above. In a particularly preferred 
embodiment, the hybridization conditions for specific hybridization are high 
stringency. 

Specific hybridization, if present, is then detected using standard methods. If 
30 specific hybridization occurs between the nucleic acid probe and HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in the test sample, then HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) has the 
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polymorphism, or is the variant, that is present in the nucleic acid probe. More than 
one nucleic acid probe can also be used concurrently in this method. Specific 
hybridization of any one of the nucleic acid probes is indicative of a polymorphism 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), or of the 
5 presence of a particular variant encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS) y or HDRP(ANLS), and is therefore diagnostic for a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

In Northern analysis (see Current Protocols in Molecular Biology, Ausubel, 

1 0 et al , supra), the hybridization methods described above are used to identify the 
presence of a polymorphism or of a particular variant, associated with a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. For Northern analysis, a test sample of RNA is obtained 
from the individual by appropriate means. Specific hybridization of a nucleic acid 

15 probe, as described above, to RNA from the individual is indicative of a 
polymorphism in HDAC9, HDAC9a, HDAC9(ANLS) y HDAC9a(ANLS), or 
HDRP(ANLS), or of the presence of a particular variant encoded by HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), and is therefore 
diagnostic for a decreased susceptibility to a cell proliferation disease, an apoptotic 

20 disease, or a cell differentiation disease. 

For representative examples of use of nucleic acid probes, see, for example, 
U.S. Patent Nos. 5,288,611 and 4,851,330. 

Alternatively, a peptide nucleic acid (PNA) probe can be used instead of a 
nucleic acid probe in the hybridization methods described above. PNA is a DNA 

25 mimic having a peptide-like, inorganic backbone, such as N-(2-aminoethyl)glycine 
units, with an organic base (A, G, C, T, or U) attached to the glycine nitrogen via a 
methylene carbonyl linker (see, for example, Nielsen et al, Bioconjugate Chemistry, 
5 (1994), American Chemical Society, p. 1 (1994)). The PNA probe can be 
designed to specifically hybridize to a gene having a polymorphism associated with 

30 a susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. Hybridization of the PNA probe to HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is diagnostic for a decreased 
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susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

In another method of the invention, mutation analysis by restriction digestion 
can be used to detect a mutant nucleic acid molecule, or nucleic acid molecules 
5 containing a polymorphism(s), if the mutation or polymorphism in the gene results 
in the creation or elimination of a restriction site. A test sample containing genomic 
DNA is obtained from the individual. Polymerase chain reaction (PCR) can be used 
to amplify HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS) 
(and, if necessary, the flanking sequences) in the test sample of genomic DNA from 
10 the test individual. RFLP analysis is conducted as described (see Current Protocols 
in Molecular Biology, supra). The digestion pattern of the relevant DNA fragment 
indicates the presence or absence of the mutation or polymorphism in HDAC9, 
HDAC9a, HDAC9(ANLS) y HDAC9a(ANLS), or HDRP(ANLS) y and therefore 
indicates the presence or absence of this decreased susceptibility to a cell 
1 5 proliferation disease, an apoptotic disease, or a cell differentiation disease. 

Sequence analysis can also be used to detect specific polymorphisms in 
- HDAC9,HDAC9a,HDAC9(ANLS),mAC9a(MLS) > oxHDRP(MLS). A test 
sample of DNA or RNA is obtained from the test individual. PCR or other 
appropriate methods can be used to amplify the nucleic acid molecule, and/or its 
20 flanking sequences, if desired. The sequence of HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS) y or HDRP(ANLS), or HDRP(ANLS), or a fragment of the any of 
those nucleic acid molecules, or an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS) 9 oxHDRP(ANLS) cDNA, or a fragment of any of those cDNAs, or 
an HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) mRNA, 
25 or a fragment of any of those mRNAs, is determined, using standard methods. The 
sequence of the above gene, gene fragment, cDNA, cDNA fragment, mRNA, or 
mRNA fragment is compared with the known nucleic acid sequence of the nucleic 
acid molecule, cDNA (e.g. 9 SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, or a nucleic acid sequence encoding the protein of SEQ ID 
30 NO: 2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, or a fragment 
thereof) or mRNA, as appropriate. The presence of a polymorphism in HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) indicates that the 
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individual has a decreased susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease. 

Allele-specific oligonucleotides can also be used to detect the presence of a 
polymorphism in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS), through the use of dot-blot hybridization of amplified 

oligonucleotides with allele-specific oligonucleotide (ASO) probes (see, for 
example, Saiki et ah, Nature (London) 324:163-166 (1986)). An "allele-specific 
oligonucleotide" (also referred to herein as an "allele-specific oligonucleotide 
probe") is an oligonucleotide of approximately 10-50 base pairs, preferably 

1 0 approximately 15-30 base pairs, that specifically hybridizes to HDA C9, HDA C9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), and that contains a 
polymorphism associated with a decreased susceptibility to a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease. An allele-specific 
oligonucleotide probe that is specific for particular polymorphisms in HDAC9, 

15 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) can be prepared, 
using standard methods (see Current Protocols in Molecular Biology, supra). 

To identify polymorphisms in the gene that are associated with a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease a test sample of DNA is obtained from the individual. PCR 

20 can be used to amplify all or a fragment of HDAC9, HDAC9a 9 HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS), and its flanking sequences. The DNA 
containing the amplified HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a (ANLS), or 
HDRP(ANLS) (or a fragment of any of those genes) is dot-blotted, using standard 
methods (see Current Protocols in Molecular Biology, supra), and the blot is 

25 contacted with the oligonucleotide probe. The presence of specific hybridization of 
the probe to the amplified HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) is then detected. Specific hybridization of an allele-specific 
oligonucleotide probe to DNA from the individual is indicative of a polymorphism 
in HDAC9, HDAC9a, HDAC9(ANLS) 9 HDA C9a(ANLS), or HDRP(ANLS), and is 

30 therefore indicative of a decreased susceptibility to a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. 
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In another embodiment, arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual, can be 
used to identify polymorphisms in HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). For example, in one embodiment, an 
5 oligonucleotide array can be used. Oligonucleotide arrays typically comprise a 
plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. These oligonucleotide arrays, also described 
as "GENECHPS™," have been generally described in the art, for example, U.S. 
Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 92/10092. 
1 0 These arrays can generally be produced using mechanical synthesis methods or light 
directed synthesis methods that incorporate a combination of photolithographic 
methods and solid phase oligonucleotide synthesis methods. See Fodor et al, 
Science, 251:767-777 (1991), Pirrung et ah, U.S. Patent No. 5,143,854; PCT 
Publication No. WO 90/15070; Fodor et a!., PCT Publication No. WO 92/10092, 
15 and U.S. Patent No. 5,424,186, the entire teachings of each of which are 

incorporated by reference herein. Techniques for the synthesis of these arrays using 
mechanical synthesis methods are described in, e.g., U.S. Patent No. 5,384,261, the 
entire teachings of which are incorporated by reference herein. 

Once an oligonucleotide array is prepared, a nucleic acid of interest is 
20 hybridized to the array and scanned for polymorphisms. Hybridization and scanning 
are generally carried out by methods described herein and also in, e.g., Published 
PCT Application Nos. WO 92/10092 and WO 95/1 1995, and US. Patent No. 
5,424,186, the entire teachings of which are incorporated by reference herein. In 
brief, a target nucleic acid sequence that includes one or more previously identified 
25 polymorphic markers is amplified by well known amplification techniques, e.g. , 
PCR. Typically, this involves the use of primer sequences that are complementary 
to the two strands of the target sequence both upstream and downstream from the 
polymorphism. Asymmetric PCR techniques may also be used. Amplified target, 
generally incorporating a label, is then hybridized with the array under appropriate 
30 conditions. Upon completion of hybridization and washing of the array, the array is 
scanned to determine the position on the array to which the target sequence 
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hybridizes. The hybridization data obtained from the scan is typically in the form of 
fluorescence intensities as a function of location on the array. 

Although primarily described in terms of a single detection block, e.g., for 
detection of a single polymorphism, arrays can include multiple detection blocks, 
5 and thus be capable of analyzing multiple, specific polymorphisms. In alternate 
arrangements, it will generally be understood that detection blocks may be grouped 
within a single array or in multiple, separate arrays so that varying, optimal 
conditions may be used during the hybridization of the target to the array. For 
example, it may often be desirable to provide for the detection of those 

10 polymorphisms that fall within G-C rich stretches of a genomic sequence, separately 
from those falling in A-T rich segments. This allows for the separate optimization 
of hybridization conditions for each situation. 

Additional descriptions of the use of oligonucleotide arrays for detection of 
polymorphisms can be found, for example, in U.S. Patent Nos. 5,858,659 and 

1 5 5,837,832, the entire teachings of which are incorporated by reference herein. 

Other methods of nucleic acid analysis can be used to detect polymorphisms 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or 
variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS). Representative methods include direct manual sequencing (Church 

20 and Gilbert Proc. Natl. Acad. Sci. USA 81 : 1991-1995, (1988); Sanger et aL, Proa 
Natl. Acad. Sci. 74: 5463-5467 (1977); Beavis etaL, U.S. Patent No. 5,288,644); 
automated fluorescent sequencing; single-stranded conformation polymorphism 
assays (SSCP); clamped denaturing gel electrophoresis (CDGE); denaturing gradient 
gel electrophoresis (DGGE) (Sheffield et aL, Proc. Natl. Acad. Sci. USA 86: 

25 232-236 (1991)), mobility shift analysis (Orita et aL, Proc. Natl. Acad. Sci. USA 86: 
2766-2770 (1989)), restriction enzyme analysis (Flavell et aL, Cell 15: 25 (1978); 
Geever, et aL, Proc, Natl. Acad. Sci. USA 78: 5081 (1981)); heteroduplex analysis; 
chemical mismatch cleavage (CMC) (Cotton et aL, Proc. Natl. Acad. Sci. USA 85: 
4397-4401 (1985)); RNase protection assays (Myers et aL, Science 230: 1242 

30 (1985)); use of polypeptides that recognize nucleotide mismatches, such as E. coli 
mutS protein; and allele-specific PCR. 
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In another embodiment of the invention, diagnosis of a susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease can also 
be made by examining the level of an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) nucleic acid, for example, using in situ 
5 hybridization techniques known to one skilled in the art, or by examining the level of 
expression, activity, and/or composition of an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide, by a variety of methods, including 
enzyme linked immunosorbent assays (ELIS As), Western blots, 
immunoprecipitations, immunohistochemistry, and immunofluorescence. A test 
10 sample from an individual is assessed for the presence of an alteration in the level of 
an HDAC9, HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS) nucleic 
acid or in the expression and/or an alteration in composition of the polypeptide 
encoded by HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), 
or for the presence of a particular variant encoded by HDAC9, HDAC9a, 
15 HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS). An alteration in expression of a 
polypeptide encoded by HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS) y or 
- HDRP(ANLS) can be, for example, an alteration in the quantitative polypeptide 
expression (i.e., the amount of polypeptide produced); an alteration in the 
composition of a polypeptide encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
20 HDAC9a( dNLSJ, or HDRP(ANLS), or an alteration in the qualitative polypeptide 
expression (e.g., expression of a mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide or variant thereof), hi a preferred 
embodiment, diagnosis of a susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease is made by detecting a particular variant 
25 encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS), 
or a particular pattern of variants. Preferably, increased levels of HDAC9, HDAC9a, 
HDAC9(ANLS)> HDA C9a ( ANLS), or HDRP(ANLS) or increased expression or 
activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, relative to a control sample, for example, a sample 
30 known not to be associated with a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease, indicates an increased susceptibility or likelihood that 
the individual has a cell proliferation disease, an apoptotic disease, or a cell 
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differentiation disease. Alternatively, decreased levels of HDAC9, HDAC9a 9 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(WLS) or decreased expression or 
activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, relative to a control sample, for example, a sample 
5 known not to be associated with a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease, indicates a decreased susceptibility or likelihood that 
the individual has a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

Both quantitative and qualitative alterations can also be present. An 
10 "alteration" or "modulation" in the polypeptide expression, activity, or composition, 
as used herein, refers to an alteration in expression or composition in a test sample, 
as compared with the expression or composition of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a control 
sample. A control sample is a sample that corresponds to the test sample (e.g., is 

15 from the same type of cells), and is from an individual who is not affected by a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. An 
alteration in the expression or composition of the polypeptide in the test sample, as 
compared with the control sample, is indicative of a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. 

20 Similarly, the presence of one or more different variants in the test sample, or the 
presence of significantly different amounts of different variants in the test sample, as 
compared with the control sample, is indicative of a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. 

It is understood that alterations or modulations in polypeptide expression or 

25 function can occur in varying degrees. For example, an alteration or modulation in 
expression can be an increase, for example, by at least L5-fold to 2-fold, at least 3- 
fold, or, at least 5-fold, relative to the control. Alternatively, the alteration or 
modulation in polypeptide expression can be a decrease, for example, by at least 
10%, at least 40%, 50%, or 75%, or by at least 90%, relative to the control 

30 Various means of examining expression or composition of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide can be 
used, including spectroscopy, colorimetry, electrophoresis, isoelectric focusing, and 
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immunoassay {e.g., David et al, U.S. Patent No. 4,376,1 10) such as 
immunoblotting (see also Ausubel et al., supra; particularly chapter 10). For 
example, in one embodiment, an antibody capable of binding to the polypeptide 
(e.g, as described above), preferably an antibody with a detectable label, can be 
5 used. Antibodies can be polyclonal, or more preferably, monoclonal An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 
"labeled," with regard to the antibody, is intended to encompass direct labeling of 
the antibody by coupling (i.e., physically linking) a detectable substance to the 
antibody, as well as indirect labeling of the antibody by reacting it with another 
10 reagent that is directly labeled. An example of indirect labeling is detection of a 
primary antibody using a fluorescently labeled secondary antibody. 

Western blotting analysis, using an antibody as described above that 
specifically binds to a mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDKP(ANLS) polypeptide, or an antibody that specifically 
1 5 binds to a non-mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an antibody that specifically binds to a particular 
- variant encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP( ANLS), can be used to identify the presence in a test sample of a particular 
variant of a polypeptide encoded by a polymorphic or mutant HDAC9, HDAC9a, 
20 HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS), or the absence in a test sample 
of a particular variant or of a polypeptide encoded by a non-polymoiphic or 
non-mutant gene. The presence of a polypeptide encoded by a polymorphic or 
mutant gene, or the absence of a polypeptide encoded by a non-polymorphic or 
non-mutant gene, is diagnostic for a decreased susceptibility to a cell proliferation 
25 disease, an apoptotic disease, or a cell differentiation disease, as is the presence (or 
absence) of particular variants encoded by the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a( ANLS), or HDRP(ANLS) nucleic acid molecule. 

In one embodiment of this method, the level or amount of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a test 
30 sample is compared with the level or amount of the HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a control 
sample. A level or amount of the polypeptide in the test sample that is higher or 
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lower than the level or amount of the polypeptide in the control sample, such that the 
difference is statistically significant, is indicative of an alteration in the expression of 
the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, and is diagnostic for a decreased susceptibility to a cell proliferation 
5 disease, an apoptotic disease, or a cell differentiation disease. 

Alternatively, the composition of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a test sample is compared with 
the composition of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide in a control sample. A difference in the composition of 

1 0 the polypeptide in the test sample, as compared with the composition of the 
polypeptide in the control sample (e.g., the presence of different variants), is 
diagnostic for a decreased susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease. In another embodiment, both the level or 
amount and the composition of the polypeptide can be assessed in the test sample 

1 5 and in the control sample. A difference in the amount or level of the polypeptide in 
the test sample, compared to the control sample; a difference in composition in the 
test sample, compared to the control sample; or both a difference in the amount or 
level, and a difference in the composition, is indicative of a decreased susceptibility 
to a cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 

20 Kits (e.g. , reagent kits) useful in the methods of diagnosis comprise 

components useful in any of the methods described herein, including, for example, 
hybridization probes or primers as described herein (e.g., labeled probes or primers), 
reagents for detection of labeled molecules, restriction enzymes (e.g. 9 for RFLP 
analysis), allele-specific oligonucleotides, antibodies that bind to a mutant or to 

25 non-mutant (native) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, means for amplification of nucleic acids comprising 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), or means 
for analyzing the nucleic acid sequence of HDAC9, HDAC9a, HDAC9(ANLS) 9 
HDAC9a(ANLS), or HDRP(ANLS), or for analyzing the amino acid sequence of an 

30 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, etc. 
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SCREENING ASSAYS AND AGENTS IDENTIFIED THEREBY 

The invention provides methods (also referred to herein as "screening 
assays") for identifying the presence of a nucleotide that hybridizes to a nucleic acid 
of the invention, as well as for identifying the presence of a polypeptide encoded by 
5 a nucleic acid of the invention. In one embodiment, the presence (or absence) of a 
nucleic acid molecule of interest (e.g., a nucleic acid that has significant homology 
with a nucleic acid of HDAC9, HDAC9a, HDAC9(ANLS) 9 HDA C9a(ANLS), or 
HDRP(ANLS)) in a sample can be assessed by contacting the sample with a nucleic 
acid comprising a nucleic acid of the invention (e.g., a nucleic acid having the 
10 sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ 
ID NO: 9, which may optionally comprise at least one polymorphism, or the 
complement thereof, or a nucleic acid encoding an amino acid having the sequence 
of SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 
10, or a fragment or variant of such nucleic acids), under stringent conditions as 
15 described above, and then assessing the sample for the presence (or absence) of 
hybridization. In a preferred embodiment, high stringency conditions are conditions 
appropriate for selective hybridization. In another embodiment, a sample containing 
the nucleic acid molecule of interest is contacted with a nucleic acid containing a 
contiguous nucleotide sequence (e.g., a primer or a probe as described above) that is 
20 at least partially complementary to a part of the nucleic acid molecule of interest 
(e.g„ an HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid), and the contacted sample is assessed for the presence or absence of 
hybridization. In a preferred embodiment, the nucleic acid containing a contiguous 
nucleotide sequence is completely complementary to a part of the nucleic acid 
25 molecule of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS). 

In any of the above embodiments, all or a portion of the nucleic acid of 
interest can be subjected to amplification prior to performing the hybridization. 

In another embodiment, the presence (or absence) of an HDAC9, HDAC9a, 
30 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, such as a 
polypeptide of the invention or a fragment or variant thereof, in a sample can be 
assessed by contacting the sample with an antibody that specifically binds to the 
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polypeptide of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (e.g., an antibody such as those described above), and then assessing 
the sample for the presence (or absence) of binding of the antibody to the HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, 
5 In another embodiment, the invention provides methods for identifying 

agents or compounds (e.g., fusion proteins, polypeptides, peptidomimetics, 
prodrugs, receptors, binding agents, antibodies, small molecules or other drugs, or 
ribozymes) that alter or modulate (e.g., increase or decrease) the activity of the 
polypeptides described herein, or that otherwise interact with the polypeptides 

10 herein. For example, such compounds can be compounds or agents that bind to 
polypeptides described herein (e.g., HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) substrates or agents); that have a stimulatory or 
inhibitory effect on, for example, activity of polypeptides of the invention; or that 
change (e.g., enhance or inhibit) the ability of the polypeptides of the invention to 

1 5 interact with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) binding agents; or that alter post-translational processing of the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide (e.g., agents that alter proteolytic processing to direct the polypeptide 
from where it is normally synthesized to another location in the cell, such as the cell 

20 surface; or agents that alter proteolytic processing such that more polypeptide is 

released from the cell, etc.). In one example, the binding agent is a cell proliferation 
disease binding agent, an apoptotic disease binding agent, or a cell differentiation 
disease binding agent. As used herein, by a "cell proliferation disease binding 
agent," an "apoptotic disease binding agent," or a "cell differentiation disease 

25 binding agent' 3 is meant an agent as described herein that binds to a polypeptide of 
the present invention and modulates a cell proliferation disease, an apoptotic disease, 
or a cell differentiation disease. The modulation can be an increase or a decrease in 
the severity or progression of the disease. In addition, a cell proliferation disease 
binding agent, an apoptotic disease binding agent, or a cell differentiation disease 

30 binding agent includes an agent that binds to a polypeptide that is upstream (earlier) 
or downstream (later) of the cell signaling events mediated by a polypeptide of the 
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present invention, and thereby modulates the overall activity of the signaling 
pathway; in turn, the disease state is modulated. 

The candidate compound can cause an increase in the activity of the 
polypeptide. For example, the activity of the polypeptide can be increased by at least 
5 1 .5-fold to 2-fold, at least 3-fold, or, at least 5-fold, relative to the control. 

Alternatively, the polypeptide activity can be a decrease, for example, by at least 
10%, at least 20%, 40%, 50%, or 75%, or by at least 90%, relative to the control. 

In one embodiment, the invention provides assays for screening candidate 
compounds or test agents to identify compounds that bind to or modulate the activity 
10 of polypeptides described herein (or biologically active portion(s) thereof), as well as 
agents identifiable by the assays. As used herein, a "candidate compound" or "test 
agent" is a chemical molecule, be it naturally-occurring or artificially-derived, and 
includes, for example, peptides, proteins, synthesized molecules, for example, 
synthetic organic molecules, naturally-occurring molecule, for example, naturally 
1 5 occurring organic molecules, nucleic acid molecules, and components thereof. 

In general, candidate compounds for uses in the present invention may be 
identified fiom large libraries of natural products or synthetic (or semi-synthetic) 
extracts or chemical libraries according to methods known in the art. Those skilled 
in the field of drug discovery and development will understand that the precise 
20 source of test extracts or compounds is not critical to the screening procedure(s) of 
the invention. Accordingly, virtually any number of chemical extracts or compounds 
can be screened using the exemplary methods described herein. Examples of such 
extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- or 
animal-based extracts, fermentation broths, and synthetic compounds, as well as 
25 modification of existing compounds. Numerous methods are also available for 
generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of 
any number of chemical compounds, including, but not limited to, saccharide-, 
lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound libraries 
• are commercially available, e.g., from Brandon Associates (Merrimack, NH) and 
30 Aldrich Chemical (Milwaukee, WI). Alternatively, libraries of natural compounds 
in the form of bacterial, fungal, plant, and animal extracts are commercially 
available from a number of sources, including Biotics (Sussex, UK), Xenova 
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(Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, FL), and 
PharmaMar, U.S A. (Cambridge, MA). In addition, natural and synthetically 
produced libraries are generated, if desired, according to methods known in the art, 
e.g., by standard extraction and fractionation methods. For example, candidate 
5 compounds can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and 
synthetic library methods using affinity chromatography selection. The biological 

1 0 library approach is limited to polypeptide libraries, while the other four approaches 
are applicable to polypeptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam, Anticancer Drug Des., 12: 145 (1997)). Furthermore, if desired, 
any library or compound is readily modified using standard chemical, physical, or 
biochemical methods. 

1 5 In addition, those skilled in the art of drug discovery and development 

readily understand that methods for dereplication (e.g., taxonomic dereplication, 
biological dereplication, and chemical dereplication, or any combination thereof) or 
the elimination of replicates or repeats of materials already known for their activities 
should be employed whenever possible. 

20 When a crude extract is found to modulate (i.e., stimulate or inhibit) the 

expression and/or activity of the nucleic acids and or polypeptides of the present 
invention, further fractionation of the positive lead extract is necessary to isolate 
chemical constituents responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization and 

25 identification of a chemical entity within the crude extract having an activity that 
stimulates or inhibits nucleic acid expression, polypeptide expression, or polypeptide 
biological activity. The same assays described herein for the detection of activities 
in mixtures of compounds can be used to purify the active component and to test 
derivatives thereof. Methods of fractionation and purification of such heterogenous 

30 extracts are known in the art. If desired, compounds shown to be useful agents for 
treatment are chemically modified according to methods known in the art. 
Compounds identified as being of therapeutic value may be subsequently analyzed 
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using animal models for diseases in which it is desirable to alter the activity or 
expression of the nucleic acids or polypeptides of the present invention. 

In one embodiment, to identify candidate compounds that alter the biological 
activity, for example, the enzymatic activity or transcriptional repression activity of 
5 an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, a cell, tissue, cell lysate, tissue lysate, or solution containing or 
expressing an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide (e.g., SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SE 
ID NO: 8, SEQ ID NO: 10, or another variant encoded by HDAC9, HDAC9a y 
10 HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS)), or a fragment or derivative 
thereof (as described above), can be contacted with a candidate compound to be 
tested under conditions suitable for enzymatic reaction or transcriptional repression 
reaction, as described herein. 

Alternatively, the polypeptide can be contacted directly with the candidate 
15 compound to be tested. The level (amount) of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) biological activity is assessed (e.g., the level 
(amount) of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) biological activity is measured, either directly or indirectly), and is 
compared with the level of biological activity in a control (i.e., the level of activity 
20 of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or active fragment or derivative thereof in the absence of the candidate 
compound to be tested, or in the presence of the candidate compound vehicle only). 
If the level of the biological activity in the presence of the candidate compound 
differs, by an amount that is statistically significant, from the level of the biological 
25 activity in the absence of the candidate compound, or in the presence of the 

candidate compound vehicle only, then the candidate compound is a compound that 
alters the biological activity of an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. For example, an increase in the 
level of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
30 enzymatic or transcriptional repression activity relative to a control, indicates that 
the candidate compound is a compound that enhances (is an agonist of) HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, 
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a decrease in the enzymatic level or transcriptional repression level of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity relative to a 
control, indicates that the candidate compound is a compound that inhibits (is an 
antagonist of) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) activity. In another embodiment, the level of biological activity of an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or derivative or fragment thereof in the presence of the candidate 
compound to be tested, is compared with a control level that has previously been 
established. A level of the biological activity in the presence of the candidate 
10 compound that differs from the control level by an amount that is statistically 

significant indicates that the compound alters HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) biological activity. 

The present invention also relates to an assay for identifying compounds that 
alter the expression of an HDA C9 t HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
15 HDRP(ANLS) nucleic acid molecule (e.g., antisense nucleic acids, fusion proteins, 
polypeptides, peptidomimetics, prodrugs, receptors, binding agents; antibodies, small 
molecules or other drugs, or ribozymes) that alter (e.g., increase or decrease) 
expression (e.g. , transcription or translation) of the nucleic acid molecule or that 
otherwise interact with the nucleic acids described herein, as well as compounds 
20 identifiable by the assays. For example, a solution containing a nucleic acid 
encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide can be contacted with a candidate compound to be tested. 
The solution can comprise, for example, cells containing the nucleic acid or cell 
lysate containing the nucleic acid; alternatively, the solution can be another solution 
25 that comprises elements necessary for transcription/translation of the nucleic acid. 
Cells not suspended in solution can also be employed, if desired. The level and/or 
pattern of HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS) f or HDRP(ANLS) 
expression (e.g., the level and/or pattern of mRNA or of protein expressed, such as 
the level and/or pattern of different variants) is assessed, and is compared with the 
30 level and/or pattern of expression in a control (i.e., the level and/or pattern of 

HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a ( ANLS) , or HDRP(ANLS) expression in 
the absence of the candidate compound, or in the presence of the candidate t 
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compound vehicle only). If the level and/or pattern in the presence of the candidate 
compound differs, by an amount or in a manner that is statistically significant, from 
the level and/or pattern in the absence of the candidate compound, or in the presence 
of the candidate compound vehicle only, then the candidate compound is a 
5 compound that alters the expression of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). Enhancement of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP( ANLS) expression indicates that the 
candidate compound is an agonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, inhibition of HDAC9, 

10 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) expression indicates 
that the candidate compound is an antagonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) activity. In another embodiment, the level 
and/or pattern of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or ' 
HDKP(ANLS) polypeptide^) (e.g., different variants) in the presence of the 

1 5 candidate compound to be tested, is compared with a control level and/or pattern that 
has previously been established. A level and/or pattern in the presence of the 
candidate compound that differs from the control level and/or pattern by an amount 
or in a manner that is statistically significant indicates that the candidate compound 
alters HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 

20 expression. 

hi another embodiment of the invention, compounds that alter the expression 
of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), oxHDRP(ANLS) 
nucleic acid molecule or that otherwise interact with the nucleic acids described 
herein, can be identified using a cell, cell lysate, or solution containing a nucleic 

25 acid encoding the promoter region of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) gene operably linked to a reporter gene. After 
contact with a candidate compound to be tested, the level of expression of the 
reporter gene (e.g., the level of rnRNA or of protein expressed) is assessed, and is 
compared with the level of expression in a control (i.e., the level of the expression 

30 of the reporter gene in the absence of the candidate compound, or in the presence of 
the candidate compound vehicle only). If the level in the presence of the candidate 
compound differs, by an amount or in a manner that is statistically significant, from 
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the level in the absence of the candidate compound, or in the presence of the 
candidate compound vehicle only, then the candidate compound is a compound that 
alters the expression of HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS), as indicated by its ability to alter expression of a gene that is 
5 operably linked to the HDAC9, HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or 
HDRP(ANLS) gene promoter. Enhancement of the expression of the reporter 
indicates that the compound is an agonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, inhibition of the expression 
of the reporter indicates that the compound is an antagonist of HDAC9, HDAC9a, 

10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity. In another 

embodiment, the level of expression of the reporter in the presence of the candidate 
compound to be tested, is compared with a control level that has previously been 
established. A level in the presence of the candidate compound that differs from the 
control level by an amount or in a manner that is statistically significant indicates 

1 5 that the candidate compound alters HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS) > or HDRP(ANLS) expression. 

Compounds that alter the amounts of different variants encoded by HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) (e.g, a compound 
that enhances activity of a first variant, and that inhibits activity of a second variant), 

20 as well as compounds that are agonists of activity of a first variant and antagonists 
of activity of a second variant, can easily be identified using these methods 
described above. 

In other embodiments of the invention, assays can be used to assess the 
impact of a candidate compound on the activity of a polypeptide in relation to an 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate, 
for example, an inhibitor of histone deacetylase activity. These inhibitors fall into 
four general classes: 1) short-chain fatty acids (e.g., 4-phenylbutyrate and valproic 
acid); 2) hydroxamic acids (e.g., SAHA, Pyroxamide, trichostatin A (TSA), 
oxamflatin and CHAPs, such as, CHAP1 and CHAP 31); 3) cyclic tetrapeptides 

30 (Trapoxin A, Apicidin and Depsipeptide (FK-228, also known as FR901 1228); 4) 
benzamides (e.g., MS-275); and other compounds such as Scriptaid. Examples of 
such assays and compounds can be found in U.S. Patent Nos. 5,369,108, issued on 
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November 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, 
issued on June 30, 1998 to Breslow et al, U.S. Patent Nos. 5,055,608, issued on 
October 8, 1991, and 5,175,191, issued on December 29, 1992 to Marks et al, as 
well as, Yoshida et al, supra; Saito et al, supra; Furamai et al., supra; Komatsu et 
5 al., supra; Su et al, supra; Lee et al, supra and Suzuki et al. supra, the entire 
content of all of which are hereby incorporated by reference. 

hi one example, a cell or tissue that expresses or contains a compound that 
interacts with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (herein referred to as an "HDAC9, HDAC9a, HDAC9(ANLS), 
10 HDAC9a(ANLS), or HDRP(ANLS) substrate," which can be a polypeptide or other 
molecule that interacts with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS)) is contacted with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) in the presence of a candidate compound, and 
the ability of the candidate compound to alter the interaction between HDAC9, 
15 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) and the HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP (ANLS) substrate is 
determined, for example, by assaying activity of the polypeptide. Alternatively, a 
cell lysate or a solution containing the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) substrate, can be used. A compound that binds 
20 to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate 
can alter the interaction by interfering with, or enhancing the ability of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) to bind to, associate 
with, or otherwise interact with the HDAC9, HDAC9a, HDAC9(ANLS), 
25 HDAC9a(ANLS), or HDRP(ANLS) substrate. 

Determining the ability of the candidate compound to bind to HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or an HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate can be 
accomplished, for example, by coupling the candidate compound with a 
30 radioisotope or enzymatic label such that binding of the candidate compound to the 
polypeptide can be determined by detecting the labeled with 12S I, 35 S, 14 C, or 3 H, 
either directly or indirectly, and the radioisotope detected by direct counting of 
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radioemmission or by scintillation counting. Alternatively, candidate compound can 
be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. 
5 It is also within the scope of this invention to determine the ability of a 

. candidate compound to interact with the polypeptide without the labeling of any of 
the interactants. For example, a microphysiometer can be used to detect the 
interaction of a candidate compound with HDAC9, HDAC9&, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) or an HDAC9, HDAC9a, HD AC9(ANLS), 

1 0 HDAC9a(ANLS), or HDRP(ANLS) substrate without the labeling of either the 
candidate compound, HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) substrate (McConnell^ al, (1992) Science, 257: 1906-1912). As 
used herein, a "microphysiometer" (e.g., CYTOSENSOR™) is an analytical 

1 5 instrument that measures the rate at which a cell acidifies its environment using a 
light-addressable potentiometric sensor (LAPS). Changes in this acidification rate 
can be used as an indicator of the interaction between ligand and polypeptide. 

In another embodiment of the invention, assays can be used to identify 
polypeptides that interact with one or more HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS) polypeptides, as described herein. For example, 
a yeast two-hybrid system such as that described by Fields and Song (Fields and 
Song, Nature 340: 245-246 (1989)) can be used to identify polypeptides that interact 
with one or more HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptides. In such a yeast two-hybrid system, vectors are 

25 constructed based on the flexibility of a transcription factor that has two functional 
domains (a DNA binding domain and a transcription activation domain). If the two 
domains are separated but fused to two different proteins that interact with one 
another, transactional activation can be achieved, and transcription of specific 
markers (e.g., nutritional markers such as His and Ade, or color markers such as 

30 lacZ) can be used to identify the presence of interaction and transcriptional 

activation. For example, in the methods of the invention, a first vector is used that 
includes a nucleic acid encoding a DNA binding domain and an HDAC9, HDAC9a, 
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HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, variant, or 
fragment or derivative thereof, and a second vector is used that includes a nucleic 
acid encoding a transcription activation domain and a nucleic acid encoding a 
polypeptide that potentially may interact with the HDAC9, HDAC9a, 
5 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, variant, or 
fragment or derivative thereof (e.g., an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide substrate or receptor). Incubation 
of yeast containing the first vector and the second vector under appropriate 
conditions (e.g., mating conditions such as used in the MATCHMAKER™ system 

1 0 from Clontech) allows identification of colonies that express the markers of 

HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). These 
colonies can be examined to identify the polypeptide(s) that interact with the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or fragment or derivative thereof. Such polypeptides may be useful as 

1 5 compounds that alter the activity or expression of an HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, as described 
above. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize an HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate, or other 
components of the assay on a solid support, in order to facilitate separation of 
complexed from uncomplexed forms of one or both of the polypeptides, as well as 
to accommodate automation of the assay. Binding of a candidate compound to the 

25 polypeptide, or interaction of the polypeptide with a substrate in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtitre plates, test 
tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein (e.g., a 
glutathione-S-transferase fusion protein) can be provided that adds a domain that 

30 allows HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or 
an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
substrate to be bound to a matrix or other solid support. 
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In another embodiment, modulators of expression of nucleic acid molecules 
of the invention are identified in a method wherein a cell, cell lysate, tissue, tissue 
lysate, or solution containing a nucleic acid encoding HD AC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is contacted with a candidate 
5 compound and the expression of appropriate mRNA or polypeptide (e.g., variant(s)) 
in the cell, cell lysate, tissue, or tissue lysate, or solution, is determined. The level 
of expression of appropriate mRNA or polypeptide(s) in the presence of the 
candidate compound is compared to the level of expression of mRNA or 
polypeptide(s) in the absence of the candidate compound, or in the presence of the 
1 0 candidate compound vehicle only. The candidate compound can then be identified 
as a modulator of expression based on this comparison. For example, when 
expression of mRNA or polypeptide is greater (statistically significantly greater) in 
the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator or enhancer of the mRNA or polypeptide 
1 5 expression. Alternatively, when expression of the mRNA or polypeptide is less 

(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of the mRNA or 
polypeptide expression. The level of mRNA or polypeptide expression in the cells 
can be determined by methods described herein for detecting mRNA or polypeptide. 
20 This invention further pertains to novel compounds identified by the 

above-described screening assays. Accordingly, it is within the scope of this 
invention to further use a compound identified as described herein in an appropriate 
animal model. For example, a compound identified as described herein (e.g., a 
candidate compound that is a modulating compound such as an antisense nucleic 
25 acid molecule, a specific antibody, or a polypeptide substrate) can be used in an 
animal model to determine the efficacy, toxicity, or side effects of treatment with 
such a compound. Alternatively, a compound identified as described herein can be 
used in an animal model to determine the mechanism of action of such a compound. 
Furthermore, this invention pertains to uses of novel compounds identified by the 
30 above-described screening assays for treatments as described herein. In addition, a 
compound identified as described herein can be used to alter activity of an HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or to 
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alter expression of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), by contacting the polypeptide or the nucleic acid molecule (or 
contacting a cell comprising the polypeptide or the nucleic acid molecule) with the 
compound identified as described herein. 

5 

PHARMACEUTICAL COMPOSITIONS 

The present invention also pertains to pharmaceutical compositions 
comprising nucleic acids described herein, particularly nucleotides encoding the 
polypeptides described herein; comprising polypeptides described herein (e.g., SEQ 
10 ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, and/or 
other variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLSJ); and/or comprising a compound that altera (e.g., increases or 
decreases) HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
expression or HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
15 HDRP(ANLS) polypeptide activity as described herein. For instance, a polypeptide, 
protein, fragment, fusion protein or prodrug thereof, or a nucleotide or nucleic acid 
construct (vector) comprising a nucleotide of the present invention, a compound that 
alters HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide activity, a compound that alters HDAC9, HDAC9a, HDAC9(ANLS), 
20 HDAC9a(ANLS), or HDRP(ANLS) nucleic acid expression, or an HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate or binding 
partner, can be formulated with a physiologically acceptable carrier or excipient to 
prepare a pharmaceutical composition. The carrier and composition can be sterile. 
The formulation should suit the mode of administration. 
25 Suitable pharmaceutically acceptable carriers include but are not limited to 

water, salt solutions (e.g., NaCl), saline, buffered saline, alcohols, glycerol, ethanol, 
gum arabic, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, 
carbohydrates such as lactose, amylose or starch, dextrose, magnesium stearate, talc, 
silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxymethylcellulose, 
30 polyvinyl pyrolidone, etc., as well as combinations thereof. The pharmaceutical 
preparations can, if desired, be mixed with auxiliary agents, e.g., lubricants, 
preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic 
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pressure, buffers, coloring, flavoring and/or aromatic substances and the like that do 
not deleteriously react with the active compounds. 

The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition can be a liquid 
5 solution, suspension, emulsion, tablet, pill, capsule, sustained release formulation, 
or powder. The composition can be formulated as a suppository, with traditional 
binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate, polyvinyl pyrollidone, sodium saccharine, cellulose, magnesium carbonate, 
10 etc. 

Methods of introduction of these compositions include, but are not limited 
to, intradermal, intramuscular, intraperitoneal, intraocular, intravenous, 
subcutaneous, topical, oral and intranasal. Other suitable methods of introduction 
can also include gene therapy (as described below), rechargeable or biodegradable 

15 devices, particle acceleration devises ("gene guns") and slow release polymeric 
devices. The pharmaceutical compositions of this invention can also be 
administered as part of a combinatorial therapy with other compounds. 

The composition can be formulated in accordance with the routine 
procedures as a pharmaceutical composition adapted for administration to human 

20 beings. For example, compositions for intravenous administration typically are 

solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 
t also include a solubilizing agent and a local anesthetic to ease pain at the site of the 
injection. Generally, the ingredients are supplied either separately or mixed together 
in unit dosage form, for example, as a dry lyophilized powder or water free 

25 concentrate in a hermetically sealed container such as an ampule or sachette 
indicating the quantity of active compound. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing 
sterile pharmaceutical grade water, saline or dextrose/water. Where the composition 
is administered by injection, an ampule of sterile water for injection or saline can be 

30 provided so that the ingredients may be mixed prior to administration. 

For topical application, nonsprayable forms, viscous to semi-solid or solid 
forms comprising a carrier compatible with topical application and having a 
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dynamic viscosity preferably greater than water, can be employed. Suitable 
formulations include but are not limited to solutions, suspensions, emulsions, 
creams, ointments, powders, enemas, lotions, sols, liniments, salves, aerosols, etc., 
that are, if desired, sterilized or mixed with auxiliary agents, e.g., preservatives, 
5 stabilizers, wetting agents, buffers or salts for influencing osmotic pressure, etc. 
The compound may be incorporated into a cosmetic formulation. For topical 
application, also suitable are sprayable aerosol preparations wherein the active 
ingredient, preferably in combination with a solid or liquid inert carrier material, is 
packaged in a squeeze bottle or in admixture with a pressurized volatile, normally 
10 gaseous propellant, e.g., pressurized air. 

Compounds described herein can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free amino groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., 
and those formed with free carboxyl groups such as those derived from sodium, 
1 5 potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 
2-ethylamino ethanol, histidine, procaine, etc. 

The compounds are administered in a therapeutically effective amount. The 
amount of compounds that will be therapeutically effective in the treatment of a 
particular disorder or condition will depend on the nature of the disorder or 
20 condition, and can be determined by standard clinical techniques. In addition, in 
vitro or in vivo assays may optionally be employed to help identify optimal dosage 
ranges. The precise dose to be employed in the formulation will also depend on the 
route of administration, and the seriousness of the symptoms of a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease, and should be decided 
25 according to the judgment of a practitioner and each patient's circumstances. 
Effective doses maybe extrapolated from dose-response curves derived from in 
vitro or animal model test systems. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
30 compositions of the invention. Optionally associated with such container(s) can be 
a notice in the form prescribed by a governmental agency regulating the 
manufacture, use or sale of pharmaceuticals or biological products, that notice 



WO 02/102984 



-72- 



POYUS02/19051 



reflects approval by the agency of manufacture, use of sale for human 
administration. The pack or kit can be labeled with information regarding mode of 
administration, sequence of drug administration (e.g., separately, sequentially or 
concurrently), or the like. The pack or kit may also include means for reminding the 
5 patient to take the therapy. The pack or kit can be a single unit dosage of the 
combination therapy or it can be a plurality of unit dosages. In particular, the 
compounds can be separated, mixed together in any combination, present in a single 
vial or tablet. Compounds assembled in a blister pack or other dispensing means is 
preferred. For the purpose of this invention, unit dosage is intended to mean a 
10 dosage that is dependent on the individual pharmacodynamics of each compound 
and administered in FDA approved dosages in standard time courses. 



METHODS OF THERAPY 

The present invention also pertains to methods of treatment (prophylactic, 

15 diagnostic, and/or therapeutic) for a cell proliferation disease, an apoptotic disease, 
or a cell differentiation disease, using an HDAC9, HDAC9a, HDAC9(ANLS), 
HD AC9a(ANLS), or HDRP(ANLS) therapeutic compound. An "HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) therapeutic 
compound" is a compound that alters (e.g., enhances or inhibits) HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity and/or 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule expression, as described herein (e.g., an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) agonist or antagonist). 
HD AC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

25 therapeutic compounds can alter HDAC9, HDAC9a, HDAC9(ANLS), 

HD AC9a(ANLS), or HDRP(ANLS) polypeptide activity or nucleic acid molecule 
expression by a variety of means, such as, for example, by providing additional 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or by upregulating the transcription or translation of the HDAC9, 

30 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule; by altering post-translational processing of the HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide; by altering 
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transcription of HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) variants; or by interfering with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity (e.g., by binding to an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
5 polypeptide), or by downregulating the transcription or translation of the HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS) nucleic acid 
molecule. Representative HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS) therapeutic compounds include the following: nucleic acids or 
fragments or derivatives thereof described herein, particularly nucleotides encoding 
10 the polypeptides described herein and vectors comprising such nucleic acids (e.g., a 
nucleic acid molecule, cDNA and/or RNA, such as a nucleic acid encoding an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or active fragment or derivative thereof, or an oligonucleotide; for 
example, SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID 
15 NO: 9, which may optionally comprise at least one polymorphism, or a nucleic acid 
encoding SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID 
NO: 10, or fragments or derivatives thereof); polypeptides described herein (e.g., 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 SEQ ID NO: 10 
and/or other variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
20 HDAC9a(ANLS), or HDRP(ANLS), or fragments or derivatives thereof); HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrates; 
peptidomimetics; fusion proteins or prodrugs thereof; antibodies (e.g., an antibody 
to a mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an antibody to a non-mutant HDAC9, HDAC9a, 
25 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an antibody to 
a particular variant encoded by HDAC9, HDAC9a, HDA C9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS), as described above); ribozymes; other small 
molecules; and other compounds that alter (e.g., enhance or inhibit) HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a( ANLS), oxHDRP(ANLS) nucleic acid 
30 expression or polypeptide activity, for example, those compounds identified in the 
screening methods described herein, or that regulate transcription of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) variants (e.g., 
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compounds that affect which variants are expressed, or that affect the amount of 
each variant that is expressed. More than one HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound can be used 
concurrently, if desired. 
5 The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) therapeutic compound that is a nucleic acid is used in the treatment 
of a cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 
The term, "treatment" as used herein, refers not only to ameliorating symptoms 
associated with the disease, but also preventing or delaying the onset of the disease, 

1 0 and also lessening the severity or frequency of symptoms of the disease. The 

therapy is designed to alter (e.g., inhibit or enhance), replace or supplement activity 
of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide in an individual. For example, an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound can be administered in 

1 5 order to upregulate or increase the expression or availability of the HDAC9 9 

HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS) 9 or HDRP(ANLS) nucleic acid molecule 
or of specific variants of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or, conversely, to downregulate or decrease the expression or 
availability of the HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or 

20 HDRP(ANLS) nucleic acid molecule or specific variants of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). Upregulation or increasing 
expression or availability of a native HDAC9, HDAC9a 9 HDAC9(ANLS) 9 
HDAC9a(ANLS) 9 or HDRP(ANLS) nucleic acid molecule or of a particular variant 
could interfere with or compensate for the expression or activity of a defective gene 

25 or another variant; downregulation or decreasing expression or availability of a 
native HDAC9 9 HDAC9a 9 HDAC9(ANLS) 9 HDA C9a(ANLS), or HDRP(ANLS) 
nucleic acid molecule or of a particular variant could minimize the expression or 
activity of a defective gene or the particular variant and thereby minimize the impact 
of the defective gene or the particular variant. 

30 The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) therapeutic compound(s) are administered in a therapeutically 
effective amount (i.e., an amount that is sufficient to treat the disease, such as by 
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ameliorating symptoms associated with the disease, preventing or delaying the onset 
of the disease, and/or also lessening the severity or frequency of symptoms of the 
disease). The amount that will be therapeutically effective in the treatment of a 
particular individual's disorder or condition will depend on the symptoms and 
5 severity of the disease, and can be determined by standard clinical techniques. In 
addition, in vitro or in vivo assays may optionally be employed to help identify 
optimal dosage ranges. The precise dose to be employed in the formulation will 
also depend on the route of administration, and the seriousness of the disease or 
disorder, and should be decided according to the judgment of a practitioner and each 
10 patients circumstances. Effective doses may be extrapolated from dose-response 
curves derived from in vitro or animal model test systems. 

In one embodiment, a nucleic acid of the invention (e.g., a nucleic acid 
encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, such as SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 

1 5 SEQ ID NO: 7, or SEQ ID NO: 9, which may optionally comprise at least one 
polymorphism, or a nucleic acid that encodes an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide or a variant, 
derivative or fragment thereof; such as a nucleic acid encoding the protein of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10) can 

20 be used, either alone or in a pharmaceutical composition as described above. For 
example, HDAC9, HDAC9a, HDAC9(ANLS) 9 HDA C9a(ANLS) y or HDRP(ANLS) or 
a cDNA encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, either by itself or included within a vector, can be 
introduced into cells (either in vitro or in vivo) such that the cells produce native 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide. If desired, cells that have been transformed with the gene or cDNA or 
a vector comprising the gene or cDNA can be introduced (or re-introduced) into an 
individual affected with the disease. Thus, cells that, in nature, lack native HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS) 9 or HDRP(ANLS) expression and 

30 activity, or have mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 9 or 
HDRP(ANLS) expression and activity, or have expression of a disease-associated 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) variant, 
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can be engineered to express an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide or an active fragment of an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide (or a different variant of an HDAC9, HDAC9a, HDAC9(ANLS), 
5 HDAC9a(ANLS), or HDRP(ANLS) polypeptide). In a preferred embodiment, 

nucleic acid encoding the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an active fragment or derivative thereof, can be 
introduced into an expression vector, such as a viral vector, and the vector can be 
introduced into appropriate cells in an animal. Other gene transfer systems, 
1 0 including viral and nonviral transfer systems, can be used. Alternatively, nonviral 
gene transfer methods, such as calcium phosphate coprecipitation, mechanical 
techniques (e.g., microinjection); membrane fusion-mediated transfer via liposomes; 
or direct DNA uptake, can also be used to introduce the desired nucleic acid 
molecule into a cell. 

15 Alternatively, in another embodiment of the invention, a nucleic acid of the 

invention; a nucleic acid complementary to a nucleic acid of the invention; or a 
portion of such a nucleic acid (e.g., an oligonucleotide as described below), can be 
used in "antisense" therapy, in which a nucleic acid (e.g., an oligonucleotide) that 
specifically hybridizes to the RNA and/or genomic DNA of HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS) is administered or generated in 
situ. The antisense nucleic acid that specifically hybridizes to the RNA and/or DNA 
inhibits expression of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) nucleic acid molecule, e.g., by inhibiting translation and/or 
transcription. Binding of the antisense nucleic acid can be by conventional base pair 

25 complementarity, or, for example, in the case of binding to DNA duplexes, through 
specific interaction in the major groove of the double helix. 

An antisense construct of the present invention can be delivered, for 
example, as an expression plasmid as described above. When the plasmid is 
transcribed in the cell, it produces RNA that is complementary to a portion of the 

30 mRNA and/or DNA that encodes an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. Alternatively, the antisense 
construct can be an oligonucleotide probe which is generated ex vivo and introduced 
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into cells; it then inhibits expression by hybridizing with the mRNA and/or genomic 
DNA of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). In 
one embodiment, the oligonucleotide probes are modified oligonucleotides that are 
resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, thereby 
5 rendering them stable in vivo. Exemplary nucleic acid molecules for use as 
antisense oligonucleotides are phosphoramidate, phosphothioate and 
methylphosphonate analogs of DNA (see also U.S. Patent Nos. 5,176,996; 
5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers useful in antisense therapy are also described, for example, by Van der 
10 Krol et ai, Biotechniques 6: 958-976 (1988); and Stein et al, Cancer Res 48: 
2659-2668 (1988). With respect to antisense DNA, oligodeoxyribonucleotides 
derived from the translation initiation site, e.g. between the -10 and +10 regions of 
an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic 
acid sequence, are preferred. 
1 5 To perform antisense therapy, oligonucleotides (RNA, cDNA or DNA) are 

designed that are complementary to mRNA encoding an HDAC9, HDAC9a, • 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. The antisense 
oligonucleotides bind to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) mRNA transcripts and prevent translation. Absolute 
complementarity, although preferred, is not required. A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in 
the case of double-stranded antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the 
antisense nucleic acid, as described in detail above. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard procedures. 
30 The oligonucleotides used in antisense therapy can be DNA, RNA, or 

chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base moiety, sugar 
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moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotides can include other appended groups such as 
peptides {e.g. for targeting host cell receptors in vivo), or compounds facilitating 
transport across the cell membrane (see, e.g., Letsinger et al, Proc. Natl. Acad. Sci. 
5 USA 86: 6553-6556 (1989); Lemaitre et aL, Proc. Natl. Acad Sci. USA 84: 648-652 
(1987); PCT International Publication No. W088/09810)) or the blood-brain barrier 
(see, e.g., PCT International Publication No. W089/10134), or 
hybridization-triggered cleavage agents (see, e.g., Krol et aL, BioTechniques 6: 
958-976 (1988)) or intercalating agents. (See, e.g., Zon, Pharm. Res. 5: 539-549 
10 (1988)). To this end, the oligonucleotide may be conjugated to another molecule 
(e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent). 

The antisense molecules are delivered to cells that express HDAC9, 
HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS) in vivo. A number of 
15 methods can be used for delivering antisense DNA or RNA to cells; e.g., antisense 
molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or 
antibodies that specifically bind receptors or antigens expressed on the target cell 
surface) can be administered systematically. Alternatively, in a preferred 
20 embodiment, a recombinant DNA construct is utilized in which the antisense 

oligonucleotide is placed under the control of a strong promoter (e.g., pol m or pol 
H). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous HDAC9, HDAC9a 9 HDAC9(ANLS), 
25 HDA C9a( ANLS), or HDRP(ANLS) transcripts and thereby prevent translation of the 
HDAC9, HDAC9a, HDAC9(AMLS), HDA C9a(ANLS), or HDRP(ANLS) mRNA. 
For example, a vector can be introduced in vivo such that it is taken up by a cell and 
directs the transcription of an antisense RNA. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the 
30 desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art and described above. For example, a 
plasmid, cosmid, YAC, or viral vector can be used to prepare the recombinant DNA 
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construct that can be introduced directly into the tissue site. Alternatively, viral 
vectors can be used that selectively infect the desired tissue, in which case 
administration may be accomplished by another route (e.g., systematically). 
Endogenous HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) expression can also be reduced by inactivating or "knocking out" 
HDAC9, HDAC9a, HDAC9(ANLS)> HDAC9a(ANLS), oxHDRP(ANLS) nucleic acid 
sequences or their promoters using targeted homologous recombination (e.g., see 
Smithies et al., Nature 317: 230-234 (1985); Thomas and Capecchi, Cell 51: 
503-512 (1987); Thompson et al., Cell 5: 313-321 (1989)). For example, a mutant, 
1 0 non-functional HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (or a completely unrelated DNA sequence) flanked by DNA 
homologous to the endogenous HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), oxHDRP(ANLS) (either the coding regions or regulatory regions 
of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS)) can be 
1 5 used, with or without a selectable marker and/or a negative selectable marker, to 
transfect cells that express HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) in vivo. Insertion of the DNA construct, via targeted homologous 
recombination, results in inactivation of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The recombinant DNA constructs can be 
20 directly administered or targeted to the required site in vivo using appropriate 
vectors, as described above. Alternatively, expression of non-mutant HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) can be increased 
using a similar method: Targeted homologous recombination can be used to insert a 
DNA construct comprising a non-mutant, functional HDAC9, HDAC9a, 
25 HDAC9(ANLS), HDA C9a(ANLS), or HDRP{ANLS) (e.g., a gene having SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9, which 
may optionally comprise at least one polymorphism), or a portion thereof, in place 
of a mutant HDAC9, HDAC9a, HDAC9(ANLS) y HDAC9a(ANLS), or HDRP(ANLS) 
in the cell, as described above. In another embodiment, targeted homologous 
30 recombination can be used to insert a DNA construct comprising a nucleic acid that 
encodes an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide variant that differs from that present in the cell. 
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Alternatively, endogenous HDAC9, HDAC9a, HDAC9(ANLS) 9 
HDAC9a(ANLS), or HDRP(ANLS) expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of HDAC9, 
HDAC9a 9 HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) (le. 9 the HDAC9, 
5 HDAC9a, HDAC9(ANLS) 3 HDAC9a(ANLS), or HDRP(ANLS) promoter and/or 
enhancers) to form triple helical structures that prevent transcription of HDAC9, 
HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in target cells in the 
body. (See generally, Helene Anticancer Drug Des., 6(6): 569-84 (1991); Helene et 
ah, Ann, N.Y. Acad. Sci., 660: 27-36 (1992); andMaher, Bioassays 14(12): 807-15 

10 (1992)). Likewise, the antisense constructs described herein, by antagonizing the 
normal biological activity of one of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) proteins, can be used in the manipulation of 
tissue, e.g., tissue differentiation, both in vivo and for ex vivo tissue cultures. 
Furthermore, the antisense techniques (e.g., microinjection of antisense molecules, 

1 5 or transfection with plasmids whose transcripts are anti-sense with regard to an 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDJRP(ANLS) mRNA or 
gene sequence) can be used to investigate role of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in developmental events, as 
well as the normal cellular function of HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS) in adult tissue. Such techniques can be utilized 
in cell culture, but can also be used in the creation of transgenic animals. 

In yet another embodiment of the invention, other HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) therapeutic compounds as 
described herein can also be used in the treatment or prevention of a cell 

25 proliferation disease, an apoptotic disease, or a cell differentiation disease. The 

therapeutic compounds can be delivered in a composition, as described above, or by 
themselves. They can be administered systemically, or can be targeted to a 
particular tissue. The therapeutic compounds can be produced by a variety of 
means, including chemical synthesis; recombinant production; in vivo production 

30 (e.g., a transgenic animal, such as U.S. Patent No. 4,873,3 16 to Meade et a/.), for 
example, and can be isolated using standard means such as those described herein. 
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A combination of any of the above methods of treatment (e.g., 
administration of non-mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide in conjunction with antisense 
therapy targeting mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) i or 
5 HDRP(ANLS) mRNA; administration of a first variant encoded by HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), oxHDRP(MLS) in conjunction with 
antisense therapy targeting a second encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS), can also be used. 

In another embodiment, the invention is directed to HDAC9, HDAC9a, 
10 HDA C9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) nucleic acid molecules and 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptides for use as a medicament in therapy. For example, the nucleic acid 
molecules or polypeptides of the present invention can be used in the treatment of a 
cell proliferation disease, an apoptotic disease, or a cell differentiation disease. In 
15 addition, the HDAC9, HDAC9a, HDA C9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) nucleic acid molecules and HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptides described herein can be used in 
the manufacture of a medicament for the treatment of a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. 
20 The invention will be further described by the following non-limiting 

examples. The teachings of all publications cited herein are incorporated herein by 
reference in their entirety. 

EXEMPLIFICATION 
25 Cloning ofcDNA encodes a novel HDAQ designated HDAC9 

HDAC9 was cloned by PCR and 3' rapid amplification of cDNA ends using 
primers designed from the sequence of human chromosome 7 whose translated 
product exhibited 80% identity to the HDAC domain of HDAC4, described in detail 
as follows. 

30 Database analyses indicate that HDRP is located on chromosome 7 (7pl 5- 

p21). The human genome database (February 2001 release) of GenBank was 
searched using the human HDAC4 amino acid sequence. The TBLASTN program 
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was used to identify open reading frames downstream of HDRP on chromosome 7 
that exhibit significant homology to the HDAC domain of HDAC4. Several 
fragments whose translated products exhibit over 58% identity were retrieved. Two 
sense primers (OL486, 5'-CCATGGAAACGGTACCCAGCAGGC-3' (SEQ ID NO: 
5 16) and OL487, S'-CACTCCATCGCTATGATGAAGGG-S 1 (SEQ ID NO: 17)) and 
antisense primers (OL484, 5'-AGTTCCCTTCATCATAGCGATGG-3* (SEQ ID 
NO: 18) and OL485, 5-AATGTACAGGATGCTGGGGT-3 1 (SEQ ID NO: 19)) 
each were designed based upon one of these fragments whose translated products 
matched amino acids 842-873 of HDAC4. RT-PCR was performed using each of 

10 the antisense primers and a sense primer 

(5'-CCCTTGTAGCTGGTGGAGTTCCCTT~3' (SEQ ID NO: 20)) from the coding 
region of HDRP and human brain cDNA as a template. PCR was performed in a 
Biometra TGRADIENT Thermocycler for 30 cycles at 95°C for 20 seconds, 60°C 
for 20 seconds, and 72 °C for 120 seconds. 

1 5 3 f -rapid amplification of cDNA ends was performed using the sense primer 

OL486 and adaptor primer 1 (Clontech), and marathon-ready cDNA from human 
brain (Clontech, Palo Alto, CA) according to the manufacturer's instruction. The 
products were re-amplified using nested sense primer OL487 and adaptor primer 2 
(Clontech, Palo Alto, CA). PCR products were cloned into pGEM-T-easy vector 

20 (Promega, Madison, WI) and sequenced using an automated DNA sequencer at the 
DNA Sequencing Core Facility of the Memorial Sloan-Kettering Cancer Center, 
using DNA sequencing methods known to one of skill in the art. 

Two cDNAs were cloned from the above-described methods. One cDNA 
(SEQ ID NO: 1) encodes an HDAC9 protein that is 101 1 amino acids in length. The 

25 other cDNA (SEQ ID NO: 3) encodes an HDAC9a protein that is 879 amino acids 
long. The cDNA sequence and amino sequence of HDAC9 and HDAC9a are shown 
in FIGS. 1 A-1G and FIGS. 2A-2B, respectively. Database analyses of these cDNAs 
against human genomic DNA sequences indicated that these two cDNAs are 
generated by alternatively splicing. An alignment of HDAC9, HDAC9a, HDRP, 

30 and HDAC4 is shown in FIGS. 3A-3C. 

Each of the HDAC9 and HDAC9a nucleic acid sequences were cloned into 
the pFLAG-CMV-5b vector (Sigma) in frame with the C-terminal FLAG tag. Only 
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the coding regions plus three extra base pairs (ACQ of cDNA of the HDAC9 and 
HDAC9a nucleic acid sequences were included in the constructs. These constructs 
are referred to herein as HDAC9-FLAG and HDAC9a-FLAG, respectively. These 
constructs are contained in E. coli, and can readily be expressed. For HDAC9, the 
5 insert is 3033 bp and for HDAC9a, the insert size is 2637 bp. Both HDAC9 and 
HDAC9a can be released with EcoRV and Bamffl (whose sites have been 
incorporated in the primers to obtain HDAC9 and HDAC9a coding cDNA for 
cloning purpose) restriction enzyme digestion. 

The HDAC9 cDNA sequences from the known 5'-end of HDRP cDNA to the 
1 0 3'-untranslated region cloned in this study cover over 5 1 1 kb of genomic DNA on 
chromosome 7. As shown in FIG. 4, the coding region cDNA of HDAC9 resides in 
23 exons spanning 458 kb of genomic sequence. Exons 21, 22, and 23 are one 
single exon in HDAC9a, but the middle exon that is numbered exon 22 in FIG. 4, 
containing an in-frame stop codon, is spliced out in HDAC9. In addition, exons 12 
15 and 13 are a single exon used by HDRP. Exon 13 is spliced as part of an intron in 
HDAC9 and HDAC9a. 

Further analysis revealed that exon 7, which contains a nuclear localization 
signal (NLS) is alternatively spliced in an HDRP isofonn, creating HDRP(ANLS). 
RT-PCR analyses using primers based on sequences from exon 6 and exon 14 
20 indicate that this alternative splicing event also occurs in HDAC9 and/or HDAC9a. 
Thus, it is possible that at least 6 proteins can be generated from a single HDAC9 
gene by alternatively splicing of its RNA. The cDNA sequences and amino acid 
sequences for HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) are shown in FIGS. 1A-10 and 2A-2E, respectively. 

25 

HDAC9 mRNA is differentially expressed among human tissues 

The expression of HDAC9 mRNA was determined by Northern blot analysis 
using a human multiple tissue Northern blot (Clontech, Palo Alto, CA). 
Hybridization was performed according to the manufacturer's instruction using 
30 ExPressHyb solution (Clontech, Palo Alto, CA). The 32 P-random priming labeled 
3'-untranslated region common to both HDAC9 and HDAC9a that shares no 
significant sequence homology with HDRP was used as a probe. Two transcripts at 
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9.8 and 4.1 kb were detected in all tissues'examined (FIG. 6A). The 4.1 kb 
transcript is shorter than the 4.4 kb HDRP transcript (See Zhou, et at, Proc. Natl. 
Acad. Sci. USA, 97:1056-1061 (2000)). A third transcript at 1.2 kb was detected in 
placenta (FIG. 6A). Similar to HDRP (See Zhou, X., et al , Proc. Natl. Acad. Sci. 
5 USA, 97:1056-1061 (2000)), high levels of HDAC9 transcripts were detected in 
brain and skeletal muscle (FIG. 6A). 

The distribution of alternatively spliced mRNA variants among tissues was 
examined by RT-PCR using primers (OL516 5 , -TGTGTCATCGAGCTGGCTTC-3 , 
(SEQ ID NO: 21) and OL517 5-ATCTTCTGCAAGTGGCTCC A-3 1 (SEQ ID NO: 

10 22)) spanning the alternatively spliced exon 22 and cDNA panel from the same 
tissues as the multiple tissue Northern blot. PCR was performed in a Biometra 
TGRADIENT Theimocycler for 30 cycles at 95°C for 20 seconds, 60°C for 20 
seconds, and 72°C for 60 seconds. The expected sizes of PCR products were 680 
base pairs for HDAC9 and 993 base pairs for HDAC9a. The ratio of HDAC9 and 

15 HDAC9a transcripts differed among tissues (FIG. 6B). In the placenta and kidney, 
the levels of the two transcripts were about the same (FIG. 6B). In the brain, heart, 
and pancreas, there were more transcripts of HDAC9 than HDAC9a. In the other 
tissues examined, there were more HDAC9a transcripts than HDAC9 transcripts 
(FIG. 6B). Under the conditions tested, HDAC9 transcripts were undetectable in 

20 liver (FIG. 6B). The lung had an HDAC9 product that was larger than expected and 
abundant. The lung also had low levels of HDAC9 transcripts and HDAC9a 
transcripts (FIG. 6B). An additional PCR product was also amplified from cDNA of 
the pancreas; this product was than the expected products from HDAC9 and 
HDAC9a (FIG. 6B). The identity of the different sized transcripts is unknown. 

25 

HDAC9 and HDAC9a possess histone deacetylase activity 

HDAC9 was named based on sequence homology to HDAC4 (FIGS. 3A- 

3C). To determine whether HDAC9 and HDAC9a possess HDAC activity, an 

HDAC enzymatic assay was performed using anti-FLAG immunoprecipitated 
30 HDAC9-FLAG and HDAC9a-FLAG. 

C-terminal FLAG-tagged HDAC9 (HDAC9-FLAG) and HDAC9a 

(HDAC9a-FLAG) expression vectors were constructed using the pFLAG-CMV-5b 
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10 



15 



vector (Sigma) and PCR amplified coding regions of HDAC9 and HDAC9a in 
frame with the FLAG-tag to form pFLAG-CMV-5b-HDAC9 (plasmid VRl) and 
P FLAG-CMV-5b-HDAC9a (plasmid VR2). All constructs were confirmed by DNA 
sequencing. 

Transfection of human kidney 293T cells, immunoprecipitation using anti- 
FLAG M2 Agarose (Sigma), Western blot analyses and dual luciferase assays were 
performed essentially as previously described by Zhou et al. (Proc. Natl. Acad. Sci. 
USA 97:1056-1061 (2000)). Briefly, the cells (American Type Culture Collection) 
were cultured in DME HG medium (GBCO/BRL) supplemented with 10% 
(vol/vol) FBS at 37 °C in a 5% C0 2 atmosphere. Transient transfection was 
performed by using Lipofectamine (GBCO/BRL) or Fugene 6 (Roche Molecular 
Biochemicals) according to the manufacturers ' instructions. Cells were harvested 
24 to 48 hours after transfection and lysed in TP lysis buffer (50 mM Tris HCl, pH 
7.5/120 mM NaCl/5 mM EDTA/0.5% NP-40) at 5 x 10 7 cells per ml. 
Immunoprecipitation with anti-FLAG M2-agarose (Sigma, St. Louis, MO) was 
performed according to the manufacturer's instructions, hnmunoprecipitated 
proteins were released from the agarose beads by using FLAG-peptide and either 
used directly for HDAC enzymatic activity assays or resolved on SDS/PAGE for 
Western blot analyses. Anti-FLAG antibody was purchased from Sigma (St. Louis, 
20 MO). Western blot analyses were performed using standard methods. 

HDAC9 and HDAC9a en2ymatic activity were assessed with the HDAC 
Fluorescent Activity Assay/Drug Discovery Kit-AK-500 (BIOMOL Research 
Laboratories) using a FLUOR DE LYS™ that contains an acetylated lysine side 
chain as a substrate and immunoprecipitated HDAC9-FLAG and HDAC9a-FLAG 
polypeptides according to the manufacturer's instruction and a SPECTRAmax® 
GEMINI XS microplate spectrofluorometer using the SOFTmax® PRO system 
(Molecular Devices) at excitation 355 nm and emission 460 nm with a cut off filter 
of 455 nm. Briefly, HDAC9-FLAG and HDAC9a-FLAG were incubated with the 
substrate overnight at room temperature in a 96-well plate. The reaction was 
stopped by addition of Fluor De Lys™ Developer and samples were read with the 
fluorometer. 



25 



30 
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As shown in FIG. 7, both HDAC9-FLAG and HDAC9a-FLAG deacetylated 
the acetylated lysine of FLUOR DE LYS™ and the activity of HDAC9 and 
HDAC9a was comparable. To examine the activity of HDAC9 and HDAC9a, 
inhibition studies using TS A were carried out by preincubating HDAC9-FLAG and 
5 HDAC9a-FLAG with TSA for 15 minutes at room temperature. The assay was then 
carried out as stated above. As shown in FIG. 7, TSA inhibited HDAC9 and 
HDAC9a deacetylase activity. The inset gel in FIG. 7 shows the amount of protein 
used in the assay. SAHA, a potent HDAC inhibitor (Richon et al. 9 Proc. Natl. Acad. 
Sci. USA, 95:3003-3007 (1998)) also completely inhibited the histone deacetylase 

10 activity of HD AC9-FLAG and HDAC9a-FLAG. The HDAC activity of HDAC9 
and HDAC9a was about ten times lower than the deacetylase activity of HDAC4 
when comparable amount of protein was used under conditions tested here. 

HDAC9 and HDAC9a enzymatic activity was also determined through 
HDAC enzymatic assays using 3 H-histones isolated from murine erythroleukemia 

1 5 cells as a substrate. This assay was performed essentially as described by Richon et 
al (Proc. Natl. Acad. Sci. USA, 95:3003-3007 (1998)). Briefly, HDAC9-FLAG 
and HDAC9a-FLAG were incubated with 3 H-histones overnight at 37°C. The 
reaction was stopped by the addition of 1M HC1/0.1 acetic acid. Released 3 H-acetic 
acid was extracted with ethyl acetate and quantified by scintillation counting. For 

20 inhibition studies, the immunoprecipitated complexes were preincubated with the 
different HDAC inhibitors for 30 minutes at 4°C. 

As shown in FIG. 8, HDAC9a-FLAG deacetylated 3 H-acetyl-histones. 
SAHA, a potent HDAC inhibitor also completely inhibited the histone deacetylase 
activity of HDAC9a-FLAG. TSA also inhibited HDAC9a deacetylase activity. 

25 Similar results were obtained when HDAC9 was used as the enzyme source. 

HDAC9 and HDAC 9a repress MEF2-mediated transcription 

The Xenopus homolog of HDRP, MITR, was identified as a MEF2 
interacting transcriptional repressor (Sparrow et al, EMBO J. 18:5085-5098(1999)) 
30 and mouse HDRP also interacts with and represses MEF2 mediated transcription 
(Zhang et al, J. Biol. Chem. 276:35-39 (2001)). We first tested whether HDAC9- 
FLAG and HDAC9a-FLAG interact with MEF2. 293 cells were transfected with 
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vector, HDAC9-FLAG, or HDAC9a-FLAG. The cells were subsequently lysed and 
HDAC9-FLAG and HDAC9a-FLAG proteins were immunoprecipitated with anti- 
FLAG antibodies. Western blot analysis of the immunoprecipitated proteins was 
carried out, using anti-MEF-2 antibody to probe the blot. As shown in FIG. 9A, 
5 both HDAC9 and HDAC9a interacted with MEF2 in 293T cells. 

It was then determined whether HDAC9 and HD AC9a repress MEF2- 
mediated transcription. This determination was carried out as follows. The 
p3XMEF2-luciferase reporter gene (100 ng) and the vector pRL-TK (Promega) (5 
ng) were co-transfected into 293T cells in the absence (pcDNA3 empty vector) or 
10 presence of MEF2C (100 ng of pCMV-MEF2C). HDAC9-F (1 ng, 10 ng, or 100 ng 
of pFLAG-HDAC9; pFLAG-HDAC9 and HDAC9-FLAG are different constructs, 
with the FLAG sequence located at opposite ends of the HDAC9 nucleotide, but are 
functionally equivalent) or HDAC9a-F (1 ng, 10 ng, or 100 ng of pFLAG-HDAC9a; 
pFLAG-HDAC9a and HDAC9a-FLAG are different constructs, with the FLAG 
1 5 sequence located at opposite ends of the HDAC9a nucleotide, but are functionally 

equivalent) was included in a subset of experimental groups with the MEF2C 
• vector. pFLAG empty vector was used to adjust the DNA to an equal amount in 
each transfection. The cells were harvested 24 to 36 hours after transfection and the 
luciferase activities were measured using the Dual-Luciferase™ Reporter Assay 
20 System from Promega according to the manufacturer's instruction. The firefly 
luciferase activity was first normalized to the co-transfected Renilla luciferase 
activity (encoded by the pRL-TK vector), and the luciferase activity value for cells 
transfected with MEF2C alone was set at 1. MEF2C activated transcription over 30 
times the basal level of transcription. As shown in FIG. 9B, HDAC9-FLAG and 
25 HDAC9a-FLAG repressed MEF2C mediated transcriptional activation in a dose- 
dependent manner and completely abolished the activation at the 100 ng dose for 
both HDAC9 and HDAC9a. The transcriptional repression effect of HDAC9 and 
HDAC9a on MEF2C mediated transcription was a specific effect since a co- 
transfected reporter gene for transfection efficiency containing a TK promoter was 
30 not repressed by HDAC9 or HDAC9a. 

Described herein is the identification and characterization of a new class II 
HDAC, designated HDAC9. HDAC9 has several alternatively spliced isoforms, 
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one of which is the previously identified HDKP (Zhou et aL, Proc. Natl. Acad. Sci. 
USA 97:1056-1061 (2000)). HDAC9 and HDAC9a possess HDAC activity, which 
appears to have a lower specific enzymatic activity than HDAC4. While not 
wishing to be bound by any particular theory, it is possible that an essential co-factor 
5 is lost during immunoprecipitation or does not exist in 293T cells (for example, 
metastasis-associated protein 2 is essential for the assembly of a catalytically active 
HDAC1 (Zhang et al y Genes Dev. 13:1924-1935 (1999)), the substrates used are 
not its natural substrate, or the FLAG tag which interferes with the folding of the 
protein. 

1 0 Searching the human genome with the HDAC domain from either HDAC 1 

or HDAC9 identified a total of 10 HDACs in the presently completed human 
genome sequence, a number of which are schematically represented in FIG. 10. 
HDACs 1, 2, 3, 8, 4, 5, 6, 7, 9, and 9a all have HDAC domains. HDRP, which is 
also schematically depicted in FIG. 10, does not have a catalytic domain. 

1 5 All references described herein are incorporated by reference in their 

entirety. While this invention has been particularly shown and described with 
reference to preferred embodiment thereof, it will be understood by those skilled in 
the art that various changes in form and details may be made therein without 
departing from the spirit and scope of the invention as defined by the appended 

20 claims. 
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CLAIMS 

What is claimed is: 

1. An isolated or recombinant histone deacetylase polypeptide, said polypeptide 
selected from: 

a) an isolated or recombinant polypeptide comprising SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; 
and 

b) an isolated or recombinant polypeptide having at least 60% sequence 
identity with any one of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 
6, SEQ ID NO: 8, or SEQ ID NO: 10. 

I. The isolated or recombinant histone deacetylase polypeptide of Claim 1, said 
polypeptide selected from: 

a) a polypeptide consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. 

<. The isolated or recombinant histone deacetylase polypeptide of Claim 1, 
wherein said polypeptide is human. 

An isolated nucleic acid molecule selected from the group: 

a) an isolated nucleic acid comprising SEQ ID NO: 1 , SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; 

b) a complement of an isolated nucleic acid comprising SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9 

c) an isolated nucleic acid encoding a histone deacetylase polypeptide 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or 
SEQ ID NO: 10; 
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d) a complement of an isolated nucleic acid encoding a histone 
deacetylase polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; 

e) a nucleic acid that is hybridizeable under high stringency conditions 
5 to a nucleic acid molecule that encodes any of SEQ ID NO: 2, SEQ 

ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or a complement 
thereof; or 

f) a nucleic acid molecule that is hybridizeable under high stringency 
conditions to a nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 

10 3, SEQ ID NO: 5, or SEQ ID NO: 7; and 

g) an isolated nucleic acid molecule that has at least 55% sequence 
identity with any one of SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 7, SEQ ID NO: 9, or a complement thereof. 



15 5 . The isolated nucleic acid molecule of Claim 4, said nucleic acid molecule 

consisting of the nucleic acid molecule selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID 
NO: 9. 

20 6. The isolated nucleic acid molecule of Claim 4, wherein said nucleic acid 
molecule is human. 



7. A vector comprising the isolated nucleic acid molecule of Claim 4. 

25 8. A cell comprising the vector of Claim 7. 

9. A cell comprising the isolated nucleic acid molecule of Claim 4. 

1 0. A purified antibody that selectively binds a polypeptide of Claim 1 . 

30 

11. A method of identifying a compound that modulates expression of a nucleic 
acid molecule of Claim 4, said method comprising the steps of: 
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a) contacting said nucleic acid molecule with a candidate compound 
under conditions suitable for expression; and 

b) assessing the level of expression of said nucleic acid molecule, 
wherein a candidate compound that increases or decreases expression of said 

5 nucleic acid molecule relative to a control is a compound that modulates 

expression of said nucleic acid molecule. 

1 2. The method of Claim 1 1 , wherein said method is carried out in a cell or 
animal. 

10 

13. The method of Claim 1 1, wherein said method is carried out in a cell free 
system. 

14. A method of identifying a compound that modulates the enzymatic activity 
1 5 of the polypeptide of Claim 1 , said method comprising the steps of: 

a) contacting said polypeptide with a candidate compound under 
conditions suitable for enzymatic reaction; and 

b) assessing the enzymatic activity level of said polypeptide, 
wherein a candidate compound that increases or decreases the enzymatic 

20 activity level of said polypeptide relative to a control is a compound that 

modulates the enzymatic activity of said polypeptide. 

15. The method of Claim 14, wherein said method is carried out in a cell or 
animal. 

25 

16. The method of Claim 14, wherein said method is carried out in a cell free 
system. 



17. 

30 



The method of Claim 14, wherein said polypeptide is further contacted with 
a substrate for the polypeptide, and wherein said substrate is selected from 
the group consisting of a cell proliferation disease binding agent, an 
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apoptotic disease binding agent, and a cell differentiation disease binding 
agent. 

1 8. The method of Claim 17, wherein said candidate compound is an inhibitor. 

5 

19. The method of Claim 17, wherein said candidate compound is an activator. 

20. A method of identifying a compound that modulates the transcriptional 
repression activity of the polypeptide of Claim 1, said method comprising 

10 the steps of: 

a) contacting said polypeptide with a candidate compound under 
conditions suitable for a transcriptional repression reaction; and 

b) assessing the transcriptional repression activity level of said 
polypeptide, 

15 wherein a candidate compound that increases or decreases the transcriptional 

repression activity level of said polypeptide relative to a control is a 
compound that modulates the transcriptional repression activity of said 
polypeptide. 

20 21. The method of Claim 20, wherein said method is carried out in a cell or 
animal. 



22. The method of Claim 20, wherein said method is carried out in a cell free 
system. 

25 

23. The method of Claim 20, wherein said polypeptide is further contacted with 
a substrate for the polypeptide, and wherein said substrate is selected from 
the group consisting of a cell proliferation disease binding agent, an 
apoptotic disease binding agent, and a cell differentiation disease binding 

30 agent. 



24. 



The method of Claim 23, wherein said candidate compound is an inhibitor. 
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25. The method of Claim 23, wherein said candidate compound is an activator. 

26. A method of identifying a compound that modulates expression of a nucleic 
acid molecule of Claim 4, said method comprising the steps of: 

5 a) providing a nucleic acid molecule comprising a promoter region of 

said nucleic acid of Claim 4 or part of a promoter region of said 
nucleic acid of Claim 4 operably linked to a reporter gene; 
b) contacting said nucleic acid molecule or with a candidate compound; 
and 

10 c ) assessing the level of said reporter gene, 

wherein a candidate compound that increases or decreases expression of said 
reporter gene relative to a control is a compound that modulates expression 
of said nucleic acid molecule of Claim 4. 



15 27. 



20 



The method of Claim 26, wherein said method is carried out in a cell. 



28. A method of identifying a polypeptide that interacts with a polypeptide of 
Claim 1 in a yeast two-hybrid system, said method comprising the steps of: 

a) providing a first nucleic acid vector comprising a nucleic acid 
molecule encoding a DNA binding domain and said polypeptide of 
Claim 1; 

b) providing a second nucleic acid vector comprising a nucleic acid 
encoding a transcription activation domain and a nucleic acid 
encoding a test polypeptide; 

25 c > contacting said first nucleic acid vector with said second nucleic acid 

vector in a yeast two-hybrid system; and 
d) assessing transcriptional activation in said yeast two-hybrid system, 
wherein an increase in transcriptional activation relative to a control 
indicates that the test polypeptide is a polypeptide that interacts with said 

30 polypeptide of Claim 1. 

29. A pharmaceutical composition comprising a polypeptide of Claim 1 . 
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30. A method of diagnosing a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease in a subject, said method comprising the steps 
of: 

a) obtaining a sample from said subject; and 
5 b) assessing the level of activity or expression of said polypeptide of 

Claim 1 in said sample, or detecting the level of said nucleic acid 
molecule of Claim 4, 
wherein if said level is increased relative to a control, then said subject has 
an increased likelihood of having a cell proliferation disease, an apoptotic 
10 disease, or a cell differentiation disease, and wherein if said level is 

decreased relative to a control, then said subject has a decreased likelihood 
- of having a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

15 31. The method of Claim 30, wherein said level of activity or expression of said 
polypeptide of Claim 1 in said sample is measured using 
immunohistochemical techniques. 

32. The method of Claim 30, wherein said level of said nucleic acid molecule of 
20 Claim 4 in said sample is measured using in situ hybridization techniques. 

33. A method of treating a cell proliferation disease, an apoptotic disease, or a 
cell differentiation disease, said method comprising administering a 
compound identified by the method of Claim 14. 

25 

34. A method of treating a cell proliferation disease, an apoptotic disease, or a 
cell differentiation disease, said method comprising administering a 
compound identified by the method of Claim 20. 



30 
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SEQUENCE LISTING 

<110> Sloan-Kettering Institute for Cancer Research 
Richon, Victoria 
Zhou , Xianbo 
Rifkind, Richard A. 
Marks, Paul A. 

<120> HDAC9 Polypeptides and Polynucleotides 
and Uses Thereof 

<130> 3254.1000005 

<150> 60/298,173 
<151> 2001-06-14 

<150> 60/311,686 
<151> 2001-08-10 

<150> 60/316,995 
<151> 2001-09-04 

<160> 22 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3186 
<212> DNA 

<213> Homo sapiens 
<400> 1 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgcctct gagcccaact tgaaggtgcg gtccaggtta 840 
aaacagaaag tggcagagag gagaagcagc cccttactca ggcggaagga tggaaatgtt 900 
gtcacttcat tcaagaagcg aatgtttgag gtgacagaat cctcagtcag tagcagttct 960 
ccaggctctg gtcccagttc accaaacaat gggccaactg gaagtgttac tgaaaatgag 1020 
acttcggttt tgccccctac ccctcatgcc gagcaaatgg tttcacagca acgcattcta 1080 
attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
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agcagtgctt gtgtggatga 
ccagtggaca gtgatgaaga 
tttatgcaac agcctttcct 
ccgctggctg cggttggcat 
tcttcccctg ctgcctctgt 
tctgcaactg gaattgccta 
tccaccaccc accctgagca 
actgggctgc taaataaatg 
cagcttgttc attctgaaca 
aagctggacc ccaggatact 
tgtggtggac ttggggtgga 
gcacgcatgg ctgttggctg 
aagaatgggt ttgctgttgt 
gggttctgct tttttaattc 
ataagcaaga tattgattgt 
ttttatgctg accccagcat 
ttccctggca gtggagcccc 
aatattgcct ggacaggtgg 
ttcaggacca tcgtgaagcc 
gctggatttg atgcattgga 
aaatgttttg gtcatttgac 
gctctagaag gaggacatga 
gcccttctag gaaatgagct 
atgaatgctg. ttatttcttt 
tcttaa 

<210> 2 

<211> 1011 

<212> PRT 

<213> Homo sapiens 



<400> 2 



Met His 


Ser Met 


He 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 




5 










10 










15 




Gly Leu 


Glu Pro 


He 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg Thr 


Asp 


Leu 


Arg 


Met 




20 










25 










30 






Met Met 


Pro Val 


Val 


Asp 


Pro 


Val 


Val 


Arg 


Glu Lys 


Gin 


Leu 


Gin 


Gin 




35 








40 










45 








Glu Leu 


Leu Leu 


He 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


Gin 


Lys 


Gin 


Leu 


Leu 


50 








55 










60 








He Ala 


Glu Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 






70 










75 










80 


Ala Gin 


Leu Gin 


Glu 
85 


His 


He 


Lys 


Glu 


Leu 
90 


Leu 


Ala 


He 


Lys 


Gin 
95 


Gin 


Gin Glu 


Leu Leu 
100 


Glu 


Lys 


Glu 


Gin 


Lys 
105 


Leu 


Glu 


Gin 


Gin 


Arg 
1-10 


Gin 


Glu 


Gin Glu 


Val Glu 


Arg 


His 


Arg 


Arg 


Glu 


Gin 


Gin 


Leu 


Pro 


Pro 


Leu 


Arg 




115 








120 










125 






Gly Lys 


Asp Arg 


Gly 


Arg 


Glu 


Arg 


Ala 


Val 


Ala 


Ser 


Thr 


Glu 


Val 


Lys 


130 








135 










140 








Gin Lys 


Leu Gin 


Glu 


Phe 


Leu 


Leu 


Ser 


Lys 


Ser 


Ala 


Thr 


Lys 


Asp 


Thr 


145 






150 










155 






160 


Pro Thr 


Asn Gly 


Lys 
165 


Asn 


His 


Ser 


Val 


Ser 
170 


Arg 


His 


Pro 


Lys 


Leu 
175 


Trp 


Tyr Thr 


Ala Ala 
180 


His 


His 


Thr 


Ser 


Leu 
185 


Asp 


Gin 


Ser 


Ser 


Pro 
190 


Pro 


Leu 


Ser Gly 


Thr Ser 


Pro 


Ser 


Tyr 


Lys 


Tyr 


Thr 


Leu 


Pro 


Gly 


Ala 


Gin 


Asp 




195 








200 










205 






Ala Lys 


Asp Asp 


Phe 


Pro 


Leu 


Arg 


Lys 


Thr 


Ala 


Ser 


Glu 


Pro 


Asn 


Leu 


210 








215 










220 










Lys Val 


Arg Ser 


Arg 


Leu 


Lys 


Gin 


Lys 


Val 


Ala 


Glu 


Arg 


Arg 


Ser 


Ser 


225 






230 










235 










240 



cacactggga caagttgggg 
tgctcagatc caggaaatgg 
ggaacccacg cacacacgtg 
ggatggatta gagaaacacc 
tttacctcac ccagcaatgg 
tgaccccttg atgctgaaac 
tgctggacga atacagagta 
tgagcgaatt caaggtcgaa 
tcactcactg ttgtatggca 
cctaggtgat gactctcaaa 
cagtgacacc atttggaatg 
tgtcatcgag ctggcttcca 
gaggccccct ggccatcacg 
agttgcaatt accgccaaat 
agatctggat gttcaccatg 
cctgtacatt tcactccatc 
aaatgaggtt ggaacaggcc 
ccttgatcct cccatgggag 
tgtggccaaa gagtttgatc 
aggccacacc cctcctctag 
gaagcaattg atgacattgg 
tctcacagcc atctgtgatg 
ggagccactt gcagaagata 
acagaagatc attgaaattc 



ctgtgaaggt caaggaggaa 1800 
aatctgggga gcaggctgct 1860 
cgctctctgt gcgccaagct 1920 
gtctcgtctc caggactcac 1980 
accgccccct ccagcctggc 2040 
accagtgcgt ttgtggcaat 2100 
tctggtcacg actgcaagaa 2160 
aagccagcct ggaggaaata 2220 
ccaaccccct ggacggacag 2280 
agtttttttc ctcattacct 2340 
agctacactc gtccggtgct 2400 
aagtggcctc aggagagctg 2460 
ctgaagaatc cacagccatg 2520 
acttgagaga ccaactaaat 2580 
gaaacggtac ccagcaggcc 2640 
gctatgatga agggaacttt 2700 
ttggagaagg gtacaatata 2760 
atgttgagta ccttgaagca 2820 
cagacatggt cttagtatct 2880 
gagggtacaa agtgacggca 2940 
ctgatggacg tgtggtgttg 3 000 
catcagaagc ctgtgtaaat 3060 
ttctccacca aagcccgaat 3120 
aaagtatgtc tttaaagttc 3180 

3186 
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Pro Leu Leu Arg Arg Lys Asp Gly Asn Val Val Thr Ser Phe Lys Lys 
245 250 255 p m 

Arg Met Phe Glu Val Thr Glu Ser Ser Val Ser Ser Ser Ser Pro Gly 

260 265 270 

Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly Ser Val Thr Glu 

275 280 285 

Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala Glu Gin Met Val 

290 295 300 

Ser Gin Gin Arg lie Leu lie His Glu Asp Ser Met Asn Leu Leu Ser 
305 310 315 320 

Leu Tyr Thr Ser Pro Ser Leu Pro Asn lie Thr Leu Gly Leu Pro Ala 

325 330 335 

Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys Glu Lys Gin Lys 

340 345 350 

Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu Pro Gly Gin Tyr 

355 360 365 

Gly Gly Ser lie Pro Ala Ser Ser Ser His Pro His Val Thr Leu Glu 

370 375 380 

Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu Gin His Leu Leu 
385 390 395 400 

Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val Ala Gly Gly Val 

405 410 415 

Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu Arg He Ser Pro 

420 425 430 

Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg Pro Leu Asn Arg 

435 440 445 

Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala Gin Leu Val He 

450 455 460 

Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys Gin Tyr Gin Gin 
465 470 475 480 

Gin He His Met Asn Lys Leu Leu Ser Lys Ser He Glu Gin Leu Lys 

485 490 495 

Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu Leu Gin Gly Asp 

500 505 510 

Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly Asn Ser Thr Arg 

515 520 525 

Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly Gin Val Gly Ala 

530 535 540 

Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu Asp Ala Gin He 
545 550 555 560 

Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met Gin Gin Pro Phe 

565 570 575 

Leu Glu Pro Thr His Thr Arg Ala Leu Ser Val Arg Gin Ala Pro Leu 

580 585 590 

Ala Ala Val Gly Met Asp Gly Leu Glu Lys His Arg Leu Val Ser Arg 

595 600 605 

Thr His Ser Ser Pro Ala Ala Ser Val Leu Pro His Pro Ala Met Asp 

610 615 620 

Arg Pro Leu Gin Pro Gly Ser Ala Thr Gly He Ala Tyr Asp Pro Leu 
625 630 635 640 

Met Leu Lys His Gin Cys Val Cys Gly Asn Ser Thr Thr His Pro Glu 

645 650 655 

His Ala Gly Arg He Gin Ser He Trp Ser Arg Leu Gin Glu Thr Gly 

660 665 670 

Leu Leu Asn Lys Cys Glu Arg He Gin Gly Arg Lys Ala Ser Leu Glu 

675 680 685 

Glu He Gin Leu Val His Ser Glu His His Ser Leu Leu Tyr Gly Thr 

690 695 700 

Asn Pro Leu Asp Gly Gin Lys Leu Asp Pro Arg He Leu Leu Gly Asp 
705 710 715 720 

Asp Ser Gin Lys Phe Phe Ser Ser Leu Pro Cys Gly Gly Leu Gly Val 
725 730 735 
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Asp Ser Asp Thr lie Trp Asn Glu Leu His Ser Ser Gly Ala Ala Arg 

740 745 750 

Met Ala Val Gly Cys val lie Glu Leu Ala Ser Lys Val Ala Ser Gly 

755 760 765 

Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro Gly His His Ala 

770 775 780 

Glu Glu Ser Thr Ala Met Gly Phe Cys Phe Phe Asn Ser Val Ala He 
785 790 795 800 

Thr Ala Lys Tyr Leu Arg Asp Gin Leu Asn He Ser Lys He Leu He 

805 810 815 

Val Asp Leu Asp Val His His Gly Asn Gly Thr Gin Gin Ala Phe Tyr 

820 825 830 

Ala Asp Pro Ser He Leu Tyr He Ser Leu His Arg Tyr Asp Glu Gly 

835 840 845 

Asn Phe Phe Pro Gly Ser Gly Ala Pro Asn Glu Val Gly Thr Glv Leu 

850 855 860 

Gly Glu Gly Tyr Asn He Asn He Ala Trp Thr Gly Gly Leu Asp Pro 
865 870 875 " 880 

Pro Met Gly Asp Val Glu Tyr Leu Glu Ala Phe Arg Thr He Val Lys 

885 890 895 

Pro Val Ala Lys Glu Phe Asp Pro Asp Met Val Leu Val Ser Ala Gly 

900 905 910 

Phe Asp Ala Leu Glu Gly His Thr Pro Pro Leu Gly Gly Tyr Lys Val 

915 920 925 

Thr Ala Lys Cys Phe Gly His Leu Thr Lys Gin Leu Met Thr Leu Ala 

930 935 940 

Asp Gly Arg Val Val Leu Ala Leu Glu Gly Gly His Asp Leu Thr Ala 
945 950 955 960 

He Cys Asp Ala Ser Glu Ala Cys Val Asn Ala Leu Leu Gly Asn Glu 

965 970 975 

Leu Glu Pro Leu Ala Glu Asp He Leu His Gin Ser Pro Asn Met Asn 

980 985 990 

Ala val He Ser Leu Gin Lys lie He Glu He Gin Ser Met Ser Leu 

995 1000 1005 

Lys Phe Ser 
1010 



<210> 3 
<211> 3499 
<212> DNA 

<213> Homo sapiens 
<400> 3 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgcctct 
aaacagaaag tggcagagag gagaagcagc 
gtcacttcat tcaagaagcg aatgtttgag 
ccaggctctg gtcccagttc accaaacaat 
acttcggttt tgccccctac ccctcatgcc 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 42 0 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
gagcccaact tgaaggtgcg gtccaggtta 840 
cccttactca ggcggaagga tggaaatgtt 900 
gtgacagaat cctcagtcag tagcagttct 960 
gggccaactg gaagtgttac tgaaaatgag 1020 
gagcaaatgg tttcacagca acgcattcta 1080 
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attcatgaag attccatgaa cctgctaagt 
accttggggc ttcccgcagt gccatcccag 
cagaagtgtg agacgcagac gcttaggcaa 
agcatcccgg catcttccag ccaccctcat 
agccaccagg ctctcctgca gcatttatta 
cttgtagctg gtggagttcc cttacatcct 
tcacctggca ttagaggtac ccacaaattg 
tctgcacctt tgcctcagag cacgttggct 
ttcttggaga agcagaagca ataccagcag 
tctattgaac aactgaagca accaggcagt 
ggggaccagg cgatgcagga agacagagcg 
agcagtgctt gtgtggatga cacactggga 
ccagtggaca gtgatgaaga tgctcagatc 
tttatgcaac agcctttcct ggaacccacg 
ccgctggctg cggttggcat ggatggatta 
tcttcccctg ctgcctctgt tttacctcac 
tctgcaactg gaattgccta tgaccccttg 
tccaccaccc accctgagca tgctggacga 
actgggctgc taaataaatg tgagcgaatt 
cagcttgttc attctgaaca tcactcactg 
aagctggacc ccaggatact cctaggtgat 
tgtggtggac ttggggtgga cagtgacacc 
gcacgcatgg ctgttggctg tgtcatcgag 
aagaatgggt ttgctgttgt gaggccccct 
gggttctgct tttttaattc agttgcaatt 
ataagcaaga tattgattgt agatctggat 
ttttatgctg accccagcat cctgtacatt 
ttccctggca gtggagcccc aaatgaggtt 
ttgtatcttt caggtaattg cattgcatga 
gttttaaatt acacgagatt actgaattgt 
gtgcataacc cagagcactg tttgtcaggg 
tgtttatttc aagagctccc atgtgcttgt 
tctcttctct gcccaccgtg gtgtgtcttt 
agggtacaat ataaatattg cctggacagg 
gtaccttgaa gcattcagga ccatcgtgaa 
ggtcttagta tctgctggat ttgatgcatt 
caaagtgacg gcaaaatgtt ttggtcattt 
acgtgtggtg ttggctctag aaggaggaca 
agcctgtgta aatgcccttc taggaaatga 
ccaaagcccg aatatgaatg ctgttatttc 
gtctttaaag ttctcttaa 



ctttatacct ctccttcttt gcccaacatt 1140 
ctcaatgctt cgaattcact caaagaaaag 1200 
ggtgttcctc tgcctgggca gtatggaggc 1260 
gttactttag agggaaagcc acccaacagc 1320 
ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cagtctccct tggcaacaaa agagagaatt 1440 
ccccgtcaca gacccctgaa ccgaacccag 1500 
cagctggtca ttcaacagca acaccagcaa 1560 
cagatccaca tgaacaaact gctttcgaaa 1620 
caccttgagg aagcagagga agagcttcag 1680 
ccctctagtg gcaacagcac taggagcgac 1740 
caagttgggg ctgtgaaggt caaggaggaa 1800 
caggaaatgg aatctgggga gcaggctgct 1860 
cacacacgtg cgctctctgt gcgccaagct 1920 
gagaaacacc gtctcgfcctc caggactcac 1980 
ccagcaatgg accgccccct ccagcctggc 2040 
atgctgaaac accagtgcgt ttgtggcaat 2100 
atacagagta tctggtcacg actgcaagaa 2160 
caaggtcgaa aagccagcct ggaggaaata 2220 
ttgtatggca ccaaccccct ggacggacag 2280 
gactctcaaa agtttttttc ctcattacct 2340 
atttggaatg agctacactc gtccggtgct 2400 
ctggcttcca aagtggcctc aggagagctg 2460 
ggccatcacg ctgaagaatc cacagccatg 2520 
accgccaaat acttgagaga ccaactaaat 2580 
gttcaccatg gaaacggtac ccagcaggcc 2640 
tcactccatc gctatgatga agggaacttt 2700 
cggtttattt ctttagagcc ccacttttat 2760 
ttacccctaa ttttcttgtc ctttgctggt 2820 
cccatgggac caagaaccag tgcagaacaa 2880 
aaggttgggc tgatttgatg tgttgtttga 2940 
tttcctctct tcttgctttc ttccatttgc 3000 
ctcttcccag gttggaacag gccttggaga 3060 
tggccttgat cctcccatgg gagatgttga 3120 
gcctgtggcc aaagagtttg atccagacat 3180 
ggaaggccac acccctcctc taggagggta 3240 
gacgaagcaa ttgatgacat tggctgatgg 3300 
tgatctcaca gccatctgtg atgcatcaga 3360 
gctggagcca cttgcagaag atattctcca 3420 
tttacagaag atcattgaaa ttcaaagtat 3480 

3499 



<210> 4 
<211> 879 
<212> PRT 

<213> Homo sapiens 



<400> 4 



Met 


His 


Ser 


Met 


lie 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 








5 










10 








15 




Gly 


Leu 


Glu 


Pro 


He 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg 


Thr 


Asp 


Leu 


Arg 


Met 








20 










25 










30 






Met 


Met 


Pro 


Val 


Val 


Asp 


Pro 


Val 


Val 


Arg 


Glu 


Lys 


Gin 


Leu 


Gin 


Gin 






35 










40 










45 








Glu 


Leu 


Leu 


Leu 


He 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


Gin 


Lys 


Gin 


Leu 


Leu 




50 










55 










60 










He 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 










70 










75 










80 


Ala 


Gin 


Leu 


Gin 


Glu 


His 


He 


Lys 


Glu 


Leu 


Leu 


Ala 


He 


Lys 


Gin 


Gin 










85 










90 










95 




Gin 


Glu 


Leu 


Leu 


Glu 


Lys 


Glu 


Gin 


Lys 


Leu 


Glu 


Gin 


Gin 


Arg 


Gin 


Glu 



100 105 110 
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Gin Glu Val 
115 

Gly Lys Asp 
130 

Gin Lys Leu 
145 

Pro Thr Asn 
Tyr Thr Ala 



Glu Arg His 

Arg Gly Arg 

Gin Glu Phe 
150 

Gly Lys Asn 

165 
Ala His His 
180 

Ser Pro Ser 



Arg Arg Glu Gin Gin Leu 
120 

Glu Arg Ala Val 
135 

Leu Leu Ser Lys 



Ser Gly Thr 
195 

Ala Lys Asp Asp Phe Pro 
210 

Lys Val Arg 
225 

Pro Leu Leu 



Arg Met Phe 

Ser Gly Pro 
275 

Asn Glu Thr 

290 
Ser Gin Gin 
305 

Leu Tyr Thr 

Val Pro Ser 

Cys Glu Thr 
355 

Gly Gly Ser 

370 
Gly Lys Pro 
385 

Leu Lys Glu 
Pro Leu His 



Ser Arg Leu 
230 

Arg Arg Lys 

245 
Glu Val Thr 
260 

Ser Ser Pro 

Ser Val Leu 

Arg He Leu 
, 310 
Ser Pro Ser 

325 
Gin Leu Asn 
340 

Gin Thr Leu 

He Pro Ala 

Pro Asn Ser 
390 

Gin Met Arg 

405 
Pro Gin Ser 
420 

Gly Thr His 



Gly He Arg 
435 

Thr Gin Ser Ala Pro Leu 

450 
Gin Gin Gin 
465 

Gin He His 



His Gin Gin 
470 

Met Asn Lys 
485 

Gin Pro Gly Ser His Leu 
500 

Gin Glu Asp 



His Ser Val Ser 
170 

Thr Ser Leu Asp 
185 

Tyr Lys Tyr Thr 
200 

Leu Arg Lys Thr 
215 

Lys Gin Lys Val 

Asp Gly Asn Val 
250 

Glu Ser Ser Val 
265 

Asn Asn Gly Pro 
280 

Pro Pro Thr Pro 
295 

He His Glu Asp 

Leu Pro Asn He 
330 

Ala Ser Asn Ser 
345 

Arg Gin Gly Val 
360 

Ser Ser Ser His 
375 

Ser His Gin Ala 

Gin Gin Lys Leu 
410 

Pro Leu Ala Thr 
425 

Lys Leu Pro Arg 
440 

Pro Gin Ser Thr 
455 

Phe Leu Glu Lys 



Ala Ser 
140 
Ser Ala 
155 

Arg His 



Gin Ser 

Leu Pro 

Ala Ser 
220 
Ala Glu 
235 

Val Thr 

Ser Ser 

Thr Gly 

His Ala 
300 
Ser Met 
315 

Thr Leu 

Leu Lys 

Pro Leu 

Pro His 
380 
Leu Leu 
395 

Leu Val 



Pro Pro 
125 

Thr Glu 

Thr Lys 

Pro Lys 

Ser Pro 
190 
Gly Ala 
205 

Glu Pro 

Arg Arg 

Ser Phe 

Ser Ser 
270 
Ser Val 
285 

Glu Gin 

Asn Leu 

Gly Leu 

Glu Lys 
350 
Pro Gly 
365 

Val Thr 



Leu Arg 

Val Lys 

Asp Thr 
160 
Leu Trp 
175 

Pro Leu 

Gin Asp 

Asn Leu 

Ser Ser 
240 
Lys Lys 
255 

Pro Gly 

Thr Glu 

Met Val 

Leu Ser 
320 
Pro Ala 
335 

Gin Lys 
Gin Tyr 
Leu Glu 



Gin His 
Ala Gly 



Gin Ala Met 
515 

Ser Asp Ser 

530 
Val Lys Val 
545 

Gin Glu Met 

Leu Glu Pro 

Ala Ala Val 
595 



Ser Ala Cys 

Lys Glu Glu 
550 

Glu Ser Gly 

565 
Thr His Thr 
580 

Gly Met Asp 



Leu Leu Ser Lys 
490 

Glu Glu Ala Glu 
505 

Arg Ala Pro Ser 
520 

Val Asp Asp Thr 
535 

Pro Val Asp Ser 

Glu Gin Ala Ala 
570 

Arg Ala Leu Ser 
585 

Gly Leu Glu Lys 
600 



Lys Glu 

His Arg 

Leu Ala 
460 
Gin Lys 
475 

Ser He 



Glu Glu 

Ser Gly 

Leu Gly 
540 
Asp Glu 
555 

Phe Met 
Val Arg 
His Arg 



Arg He 
430 
Pro Leu 
445 

Gin Leu 

Gin Tyr 

Glu Gin 

Leu Gin 
510 
Asn Ser 
525 

Gin Val 
Asp Ala 
Gin Gin 



Leu Leu 
400 
Gly Val 
415 

Ser Pro 

Asn Arg 

Val He 

Gin Gin 
480 
Leu Lys 
495 

Gly Asp 

Thr Arg 

Gly Ala 

Gin He 
560 
Pro Phe 
575 

Pro Leu 



Gin Ala 
590 

Leu Val Ser Arg 
605 
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Thr 


His 


Ser 


Ser 


Pro Ala Ala 


Ser Val Leu 


Pro 


His 


Pro 


Ala 


Met 


Asp 




610 






615 




'*' 


620 










Arg 


Pro 


Leu 


Gin 


Pro Gly Ser Ala Thr Gly 


He 


Ala 


Tyr 


Asp 


Pro 


Leu 


625 








630 




635 










640 


Met 


Leu 


Lys 


His 


Gin Cys Val Cys Gly Asn 


Ser 


Thr 


Thr 


His 


Pro 


Glu 










645 


650 










655 




His 


Ala 


Gly 


Arg 


He Gin Ser 


He Trp Ser 


Arg 


Leu 


Gin 


Glu 


Thr 


Gly 








660 




665 








670 






Leu 


Leu 


Asn 


Lys 


Cys Glu Arg 


He Gin Gly 


Arg 


Lys 


Ala 


Ser 


Leu 


Glu 






675 






680 






685 








Glu 


He 


Gin 


Leu 


Val His Ser 


Glu His His 


Ser 


Leu 


Leu 


Tyr 


Gly 


Thr 




690 






695 






700 








Asn 


Pro 


Leu 


Asp 


Gly Gin Lys 


Leu Asp Pro 


Arg 


He 


Leu 


Leu 


Gly 


Asp 


705 








710 




715 










720 


Asp 


Ser 


Gin 


Lys 


Phe Phe Ser 


Ser Leu Pro 


Cys 


Gly 


Gly 


Leu 


Gly 


Val 










725 


730 










735 




Asp 


Ser 


Asp 


Thr 


lie Trp Asn 


Glu Leu His 


Ser 


Ser 


Gly 


Ala 


Ala 


Arg 








740 




745 








750 






Met 


Ala 


Val 


Gly 


Cys Val He 


Glu Leu Ala 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 






755 






760 






765 








Glu 


Leu 


Lys 


Asn 


Gly Phe Ala Val Val Arg 


Pro 


Pro 


Gly 


His 


His 


Ala 




770 






775 






780 










Glu 


Glu 


Ser 


Thr 


Ala Met Gly Phe Cys Phe 


Phe 


Asn 


Ser 


Val 


Ala 


He 


785 








790 




795 










800 


Thr 


Ala 


Lys 


Tyr 


Leu Arg Asp Gin Leu Asn 


He 


Ser 


Lys 


He 


Leu 


He 










805 


810 










815 




Val 


Asp 


Leu 


Asp 


Val His His Gly Asn Gly 


Thr 


Gin 


Gin 


Ala 


Phe 


Tyr 








820 




825 








830 






Ala 


Asp 


Pro 


Ser 


He Leu Tyr 


He Ser Leu 


His 


Arg 


Tyr 


Asp 


Glu 


Gly 






835 






840 






845 








Asn 


Phe 


Phe 


Pro 


Gly Ser Gly Ala Pro Asn 


Glu 


Val 


Arg 


Phe 


He 


Ser 




850 






855 






860 










Leu 


Glu 


Pro 


His 


Phe Tyr Leu Tyr Leu Ser 


Gly 


Asn 


Cys 


He 


Ala 





865 870 875 



<210> 5 

<211> 3054 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ggggaagaga ggcacagaca 
tgagggtttt tgcaacaaaa 
ggacgagagc agctcttggc 
aagtcagaag ttcctgtggg 
aggatgatga tgcccgtggt 
cttcttatcc agcagcagca 
cagcatgaga acttgacacg 
ctagccataa aacagcaaca 
caagaacagg aagtagagag 
gatagaggac gagaaagggc 
ctactgagta aatcagcaac 
cgccatccca agctctggta 
ccccttagtg gaacatctcc 
gatgatttcc cccttcgaaa 
cccagttcac caaacaatgg 
ccccctaccc ctcatgccga 
tccatgaacc tgctaagtct 
cccgcagtgc catcccagct 
acgcagacgc ttaggcaagg 
tcttccagcc accctcatgt 



cagataggag aagggcaccg 
ccctagcagc ctgaagaact 
tcagcaaaga atgcacagta 
cctggagccc atctcacctt 
ggaccctgtt gtccgtgaga 
acaaatccag aagcagcttc 
gcagcaccag gctcagcttc 
agaactccta gaaaaggagc 
gcatcgcaga gaacagcagc 
agtggcaagt acagaagtaa 
gaaagacact ccaactaatg 
cacggctgcc caccacacat 
atcctacaag tacacattac 
aactgaatcc tcagtcagta 
gccaactgga agtgttactg 
gcaaatggtt tcacagcaac 
ttatacctct ccttctttgc 
caatgcttcg aattcactca 
tgttcctctg cctgggcagt 
tactttagag ggaaagccac 



gctggagcca cttgcaggac 60 
ctaagccaga tggggtggct 120 
tgatcagctc agtggatgtg 180 
tagacctaag gacagacctc 240 
agcaattgca gcaggaatta 300 
tgatagcaga gtttcagaaa 360 
aggagcatat caaggaactt 420 
agaaactgga gcagcagagg 480 
ttcctcctct cagaggcaaa 540 
agcagaagct tcaagagttc 600 
gaaaaaatca ttccgtgagc 660 
cattggatca aagctctcca 720 
caggagcaca agatgcaaag 780 
gcagttctcc aggctctggt 840 
aaaatgagac ttcggttttg 900 
gcattctaat tcatgaagat 960 
ccaacattac cttggggctt 1020 
aagaaaagca gaagtgtgag 1080 
atggaggcag catcccggca 1140 
ccaacagcag ccaccaggct 1200 
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ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
g ^ g ?f t ? tB tcatc ^agct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttgg aacaggcctt ggagaagggt acaatataaa tattgcctgg 2640 
acaggtggcc ttgatcctcc catgggagat gttgagtacc ttgaagcatt caggaccatc 2700 
gtgaagcctg tggccaaaga gtttgatcca gacatggtct tagtatctgc tggatttgat 2760 
gcattggaag gccacacccc tcctctagga gggtacaaag tgacggcaaa atgttttggt 2820 
catttgacga agcaattgat gacattggct gatggacgtg tggtgttggc tctagaagga 2880 
ggacatgatc tcacagccat ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga 2940 ' 
%zi*??t ag a sccacttgc agaagatatt ctccaccaaa gcccgaatat gaatgctgtt 3 000 
atttctttac agaagatcat tgaaattcaa agtatgtctt taaagttctc ttaa 3054 

<210> 6 
<211> 967 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met His Ser Met He Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

5 10 15 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

50 55 60 

He Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 110 

Gin Glu val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 
130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 

» ^ 150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 i 7 5 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 
180 185 190 
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Ser 


Gly 


Thr 


Ser Pro Ser Tyr 


Lys 






195 




200 


Ala 


Lys 


Asp 


Asp Phe Pro Leu 


Arg 




210 




215 




Ser 


Ser 


Pro 


Gly Ser Gly Pro 


Ser 


225 






230 




Ser 


Val 


Thr 


Glu Asn Glu Thr 


Ser 








245 




Glu 


Gin 


Met 


Val Ser Gin Gin 


Arg 








260 




Asn 


Leu 


Leu 


Ser Leu Tyr Thr 


Ser 






275 




280 


Gly 


Leu 


Pro 


Ala Val Pro Ser 


Gin 




290 




295 




Glu 


Lys 


Gin 


Lys Cys Glu Thr 


Gin 


305 






310 




Pro 


Gly 


Gin 


Tyr Gly Gly Ser 


He 








325 




Val 


Thr 


Leu 


Glu Gly Lys Pro 


Pro 








340 




Gin 


His 


Leu 


Leu Leu Lys Glu 


Gin 






355 




360 


Ala 


Gly 


Gly 


Val Pro Leu His 


Pro 




370 




375 




Arg 


He 


Ser 


Pro Gly He Arg 


Gly 


385 






390 




Pro 


Leu 


Asn 


Arg Thr Gin Ser 


Ala 








405 




Gin 


Leu 


Val 


He Gin Gin Gin 


His 








420 




Gin 


Tyr 


Gin 


Gin Gin He His 


Met 






435 




440 


Glu 


Gin 


Leu 


Lys Gin Pro Gly 


Ser 




450 




455 




Leu 


Gin 


Gly 


Asp Gin Ala Met 


Gin 


465 






470 




Asn 


Ser 


Thr 


Arg Ser Asp Ser 


Ser 








485 




Gin 


Val 


Gly 


Ala Val Lys Val 


Lys 








500 




Asp 


Ala 


Gin 


He Gin Glu Met 


Glu 






515 




520 


Gin 


Gin 


Pro 


Phe Leu Glu Pro 


Thr 




530 




535 




Gin 


Ala 


Pro 


Leu Ala Ala Val 


Gly 


545 






550 




Leu 


Val 


Ser 


Arg Thr His Ser 


Ser 








565 




Pro 


Ala 


Met 


Asp Arg Pro Leu 


Gin 








580 




Tyr 


Asp 


Pro 


Leu Met Leu Lys 


His 






595 




600 


Thr 


His 


Pro 


Glu His Ala Gly 


Arg 




610 




615 




Gin 


Glu 


Thr 


Gly Leu Leu Asn 


Lys 


625 






630 




Ala 


Ser 


Leu 


Glu Glu He Gin 


Leu 








645 




Leu 


Tyr 


Gly 


Thr Asn Pro Leu 


Asp 








660 




Leu 


Leu 


Gly 


Asp Asp Ser Gin 


Lys 






675 




680 



Tyr Thr 


Leu 


Pro 


Gly 


Ala 


Gin 


Asp 










205 








Lys 


Thr 


Glu 


Ser 


Ser 


Val 


Ser 


Ser 








220 










Ser 


Pro 


Asn 


Asn 


Gly 


Pro 


Thr 


Gly 






235 










240 


Val 


Leu 


Pro 


Pro 


Thr 


Pro 


His 


Ala 




250 










255 




He 


Leu 


He 


His 


Glu 


Asp 


Ser 


Met 


265 










270 






Pro 


Ser 


Leu 


Pro 


Asn 


He 


Thr 


Leu 










285 








Leu 


Asn 


Ala 


Ser 


Asn 


Ser 


Leu 


Lys 








300 










Thr 


Leu 


Arg 


Gin 


Gly 


Val 


Pro 


Leu 






315 










320 


Pro 


Ala 


Ser 


Ser 


Ser 


His 


Pro 


His 




330 










335 




Asn 


Ser 


Ser 


His 


Gin 


Ala 


Leu 


Leu 


345 










350 






Met Arg 


Gin 


Gin 


Lys 


Leu 


Leu 


val 










365 








Gin 


Ser 


Pro 


Leu 


Ala 


Thr 


Lys 


Glu 








380 










Thr 


His 


Lys 


Leu 


Pro 


Arg 


His 


Arg 






395 










400 


Pro 


Leu 


Pro 


Gin 


Ser 


Thr 


Leu 


Ala 




410 










415 




Gin 


Gin 


Phe 


Leu 


Glu 


Lys 


Gin 


Lys 


425 










430 






Asn Lys 


Leu 


Leu 


Ser 


Lys 


Ser 


He 










445 








His 


Leu 


Glu 


Glu 


Ala 


Glu 


Glu 


Glu 








460 










Glu Asp 


Arg 


Ala 


Pro 


Ser 


Ser 


Gly 






475 










480 


Ala 


Cys 


Val 


Asp 


Asp 


Thr 


Leu 


Gly 




490 










495 




Glu 


Glu 


Pro 


Val 


Asp 


Ser 


Asp 


Glu 


505 










510 






Ser Gly 


Glu 


Gin 


Ala 


Ala 


Phe 


Met 










525 








His 


Thr 


Arg 


Ala 


Leu 


Ser 


Val 


Arg 








540 










Met 


Asp 


Gly 


Leu 


Glu 


Lys 


His 


Arg 






555 










560 


Pro 


Ala 


Ala 


Ser 


Val 


Leu 


Pro 


His 




570 










575 




Pro Gly 


Ser 


Ala 


Thr 


Gly 


He 


Ala 


585 










590 






Gin Cys 


Val 


Cys 


Gly 


Asn 


Ser 


Thr 










605 








He 


Gin 


Ser 


lie 


Trp 


Ser 


Arg 


Leu 








620 










Cys 


Glu 


Arg 


He 


Gin 


Gly 


Arg 


Lys 






635 










640 


Val 


His 


Ser 


Glu 


His 


His 


Ser 


Leu 




650 










655 




Gly Gin 


Lys 


Leu 


Asp 


Pro 


Arg 


He 


665 










670 






Phe 


Phe 


Ser 


Ser 


Leu 


Pro 


Cys 


Gly 



685 
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Gly Leu 
690 
Gly Ala 
705 

Val Ala 

Gly His 

Ser Val 

Lys He 
770 
Gin Ala 
785 

Tyr Asp 

Gly Thr 

Gly Leu 

Thr He 
850 
. Val Ser 
865 

Gly Tyr 

Met Thr 

Asp Leu 

Leu Gly 
930 
Pro Asn 
945 

Ser Met 



Gly Val Asp Ser 
Ala Arg 
Ser Gly 



His Ala 
740 
Ala He 
755 

Leu He 

Phe Tyr 

Glu Gly 

Gly Leu 
820 
Asp Pro 
835 

Val Lys 

Ala Gly 

Lys Val 

Leu Ala 
900 
Thr Ala 
915 

Asn Glu 
Met Asn 
Ser Leu 



Met Ala 
710 
Glu Leu 
725 

Glu Glu 



Thr Ala 

Val Asp 

Ala Asp 
790 
Asn Phe 
805 

Gly Glu 

Pro Met 

Pro Val 

Phe Asp 
870 
Thr Ala 
885 

Asp Gly 

He Cys 

Leu Glu 

Ala Val 
950 
Lys Phe 
965 



Asp Thr 
695 

Val Gly 

Lys Asn 

Ser Thr 

Lys Tyr 
760 
Leu Asp 
775 

Pro Ser 

Phe Pro 

Gly Tyr 

Gly Asp 
840 
Ala Lys 
855 

Ala Leu 

Lys Cys 

Arg Val 

Asp Ala 
920 
Pro Leu 
935 

He Ser 
Ser 



He Trp 
Cys Val 



Asn Glu Leu 

700 
He Glu Leu 
715 

Ala Val Val 



Gly Phe 
730 

Ala Met Gly Phe Cys 
745 

Leu Arg 



Val His 

He Leu 

Gly Ser 
810 
Asn He 
825 

Val Glu 
Glu Phe 
Glu Gly 



Asp Gin Leu 
765 

His Gly Asn 

780 
Tyr He Ser 
795 

Gly Ala Pro 

Asn He Ala 

Tyr Leu Glu 
845 

Asp Pro Asp 

860 
His Thr Pro 
875 

His Leu Thr 



Phe Gly 
890 

Val Leu Ala Leu Glu 
905 

Ser Glu 



Ala Glu 
Leu Gin 



Ala Cys Val 
925 

Asp He Leu 

940 
Lys He He 
955 



His Ser Ser 

Ala Ser Lys 
720 

Arg Pro Pro 

735 
Phe Phe Asn 
750 

Asn He Ser 

Gly Thr Gin 

Leu His Arg 
800 

Asn Glu Val 

815 
Trp Thr Gly 
830 

Ala Phe Arg 

Met Val Leu 

Pro Leu Gly 
880 

Lys Gin Leu 
895 

Gly Gly His 
910 

Asn Ala Leu 

His Gin Ser 

Glu lie Gin 
960 



<210> 7 

<211> 3367 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgaatcc 
cccagttcac caaacaatgg gccaactgga 
ccccctaccc ctcatgccga gcaaatggtt 
tccatgaacc tgctaagtct ttatacctct 
cccgcagtgc catcccagct caatgcttcg 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
tcagtcagta gcagttctcc aggctctggt 840 
agtgttactg aaaatgagac ttcggttttg 900 
tcacagcaac gcattctaat tcatgaagat 960 
cc ttctttgc ccaacattac cttggggctt 1020 
aattcactca aagaaaagca gaagtgtgag 1080 
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acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 13 80 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttcg gtttatttct ttagagcccc acttttattt gtatctttca 2640 
ggtaattgca ttgcatgatt acccctaatt ttcttgtcct ttgctggtgt tttaaattac 2700 
acgagattac tgaattgtcc catgggacca agaaccagtg cagaacaagt gcataaccca 2760 
gagcactgtt tgtcagggaa ggttgggctg atttgatgtg ttgtttgatg tttatttcaa 2820 
gagctcccat gtgcttgttt tcctctcttc ttgctttctt ccatttgctc tcttctctgc 2880 
ccaccgtggt gtgtctttct cttcccaggt tggaacaggc cttggagaag ggtacaatat 2940 
aaatattgcc tggacaggtg gccttgatcc tcccatggga gatgttgagt accttgaagc 3000 
attcaggacc atcgtgaagc ctgtggccaa agagtttgat ccagacatgg tcttagtatc 3060 
tgctggattt gatgcattgg aaggccacac ccctcctcta ggagggtaca aagtgacggc 3120 
aaaatgtttt ggtcatttga cgaagcaatt gatgacattg gctgatggac gtgtggtgtt 3180 
ggctctagaa ggaggacatg atctcacagc catctgtgat gcatcagaag cctgtgtaaa 3240 
tgcccttcta ggaaatgagc tggagccact tgcagaagat attctccacc aaagcccgaa 3300 
tatgaatgct gttatttctt tacagaagat cattgaaatt caaagtatgt ctttaaagtt 3360 
ctcttaa 3367 

<210> 8 
<211> 835 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

15 10 15 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

50 55 60 

He Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 80 

Ala Gin Leu Gin Glu His lie Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 110 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 
115 120 125 
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Gly Lys 
130 
Gin Lys 
145 

Pro Thr 

Tyr Thr 

Ser Gly 

Ala Lys 
210 
Ser Ser 
225 

Ser Val 

Glu Gin 

Asn Leu 

Gly Leu 
290 
Glu Lys 
305 

Pro Gly 

Val Thr 

Gin His 

Ala Gly 
370 
Arg He 
385 

Pro Leu 



Asp Arg Gly Arg 
Leu Gin 
Asn Gly 



Ala Ala 
180 
Thr Ser 
195 

Asp Asp 

Pro Gly 

Thr Glu 

Met Val 
260 
Leu Ser 
275 

Pro Ala 

Gin Lys 

Gin Tyr 

Leu Glu 
340 
Leu Leu 
355 

Gly Val 
Ser Pro 
Asn Arg 



Glu Phe 
150 
Lys Asn 
165 

His His 



Gin Leu 

Gin Tyr 

Glu Gin 
450 
Leu Gin 
465 

Asn Ser 

Gin Val 

Asp Ala 

Gin Gin 
530 
Gin Ala 
545 

Leu Val 

Pro Ala 

Tyr Asp 

Thr His 
610 



Val He 
42 0 
Gin Gin 
435 

Leu Lys 

Gly Asp 

Thr Arg 

Gly Ala 
500 
Gin He 
515 

Pro Phe 

Pro Leu 

Ser Arg 

Met Asp 
580 
Pro Leu 
595 

Pro Glu 



Pro Ser 

Phe Pro 

Ser Gly 
230 
Asn Glu 
245 

Ser Gin 

Leu Tyr 

Val Pro 

Cys Glu 
310 
Gly Gly 
325 

Gly Lys 

Leu Lys 

Pro Leu 

Gly He 
390 
Thr Gin 
405 

Gin Gin 



Glu Arg 
135 

Leu Leu 

His Ser 

Thr Ser 

Tyr Lys 
200 
Leu Arg 
215 

Pro Ser 

Thr Ser 

Gin Arg. 

Thr Ser 
280 
Ser Gin 
295 

Thr Gin 
Ser He 
Pro Pro 



Glu Gin 
360 
His Pro 
375 

Arg Gly 
Ser Ala 
Gin His 



Ala Val 

Ser Lys 

Val Ser 
170 
Leu Asp 
185 

Tyr Thr 

Lys Thr 

Ser Pro 

Val Leu 
250 
He Leu 
265 

Pro Ser 

Leu Asn 

Thr Leu 

Pro Ala 
330 
Asn Ser 
345 

Met Arg 



Gin He 

Gin Pro 

Gin Ala 
470 
Ser Asp 
485 

Val Lys 

Gin Glu 

Leu Glu 

Ala Ala 
550 
Thr His 
565 

Arg Pro 
Met Leu 
His Ala 



His Met 
440 
Gly Ser 
455 

Met Gin 

Ser Ser 

Val Lys 

Met Glu 
520 
Pro Thr 
535 

Val Gly 

Ser Ser 

Leu Gin 

Lys His 
600 
Gly Arg 
615 



Gin Ser 

Thr His 

Pro Leu 
410 
Gin Gin 
425 

Asn Lys 

His Leu 

Glu Asp 

Ala Cys 
490 
Glu Glu 
505 

Ser Gly 

His Thr 

Met Asp 

Pro Ala 
570 
Pro Gly 
585 

Gin Cys 
lie Gin 



Ala Ser 
140 
Ser Ala 
155 

Arg His 

Gin Ser 

Leu Pro 

Glu Ser 
220 
Asn Asn 
235 

Pro Pro 

lie His 

Leu Pro 

Ala Ser 
300 
Arg Gin 
315 

Ser Ser 

Ser His 

Gin Gin 

Pro Leu 
380 
Lys Leu 
395 

Pro Gin 



Thr Glu Val Lys 
Thr Lys 
Pro Lys 



Ser Pro 
190 
Gly Ala 
205 

Ser Val 

Gly Pro 

Thr Pro 

Glu Asp 
270 
Asn He 
285 

Asn Ser 

Gly Val 

Ser His 

Gin Ala 
350 
Lys Leu 
365 

Ala Thr 
Pro Arg 
Ser Thr 



Asp Thr 
160 
Leu Trp 
175 

Pro Leu 



Phe Leu 

Leu Leu 

Glu Glu 
460 
Arg Ala 
475 

Val Asp 

Pro Val 

Glu Gin 

Arg Ala 
540 
Gly Leu 
555 

Ala Ser 

Ser Ala 

Val Cys 

Ser He 
620 



Glu Lys 
430 
Ser Lys 
445 

Ala Glu 

Pro Ser 

Asp Thr 

Asp Ser 
510 
Ala Ala 
525 

Leu Ser 

Glu Lys 

Val Leu 

Thr Gly 
590 
Gly Asn 
605 

Trp Ser 



Gin Asp 

Ser Ser 

Thr Gly 
240 
His Ala 
255 

Ser Met 

Thr Leu 

Leu Lys 

Pro Leu 
320 
Pro His 
335 

Leu Leu 

Leu Val 

Lys Glu 

His Arg 
400 
Leu Ala 
415 

Gin Lys 

Ser lie 

Glu Glu 

Ser Gly 
480 
Leu Gly 
495 

Asp Glu 

Phe Met 

Val Arg 

His Arg 
560 
Pro His 
575 

lie Ala 
Ser Thr 
Arg Leu 
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Gin 


Glu 


Thr 


Gly 


Leu Leu Asn 


Lys Cys 


Glu 


Arg He 


Gin Gly Arg Lys 


625 








630 






635 




640 


Ala 


Ser 


Leu 


Glu 


Glu He Gin 


Leu Val 


His 


Ser Glu 


His His 


Ser Leu 










645 




650 






655 


Leu 


Tyr 


Gly 


Thr 


Asn Pro Leu Asp Gly 


Gin 


Lys Leu 


Asp Pro 


Arg He 








660 




665 






670 




Leu 


Leu 


Gly 


Asp 


Asp Ser Gin 


Lys Phe 


Phe 


Ser Ser 


Leu Pro 


Cys Gly 






675 






680 






685 




Gly 


Leu 


Gly 


Val 


Asp Ser Asp 


Thr He 


Trp 


Asn Glu 


Leu His 


Ser Ser 




690 






695 






700 






Gly 


Ala 


Ala 


Arg 


Met Ala Val 


Gly Cys 


Val 


He Glu 


Leu Ala 


Ser Lys 


705 








710 






715 




720 


Val 


Ala 


Ser 


Gly 


Glu Leu Lys 


Asn Gly 


Phe 


Ala Val 


Val Arg 


Pro Pro 










725 




730 






735 


Gly 


His 


His 


Ala 


Glu Glu Ser 


Thr Ala 


Met 


Gly Phe 


Cys Phe 


Phe Asn 








740 




745 






750 




Ser 


Val 


Ala 


He 


Thr Ala Lys 


Tyr Leu 


Arg 


Asp Gin 


Leu Asn 


He Ser 






755 






760 






765 




Lys 


lie 


Leu 


He 


Val Asp Leu Asp Val 


His 


His Gly 


Asn Gly Thr Gin 




770 






775 






780 






Gin 


Ala 


Phe 


Tyr 


Ala Asp Pro 


Ser He 


Leu 


Tyr He 


Ser Leu 


His Arg 


785 








790 






795 




800 


Tyr 


Asp 


Glu 


Gly 


Asn Phe Phe 


Pro Gly 


Ser 


Gly Ala 


Pro Asn 


Glu Val 










805 




810 






815 


Arg 


Phe 


He 


Ser 


Leu Glu Pro 


His Phe 


Tyr 


Leu Tyr 


Leu Ser Gly Asn 








820 




825 






830 




Cys 


He 


Ala 




















835 

















<210> 9 

<211> 1791 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttigacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 
cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 
ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 
tccatgaacc tgctaagtct ttatacctct ccttctttgc ccaacattac cttggggctt 1020 
cccgcagtgc catcccagct caatgcttcg aattcactca aagaaaagca gaag'tgtgag 1080 
acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cc tcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
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atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
gtaataggca aagatttagc tccaggattt gtaattaaag tcattatctg a 1791. 

<210> 10 

<211> 546 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

15 10 15 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

50 55 60 

He Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 110 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
145 150 155 " 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 
22 5 230 235 240 

Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 255 

Glu Gin Met Val Ser Gin Gin Arg He Leu He His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn lie Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 

290 295 300 

Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 
305 310 315 320 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg He Ser Pro Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg 
385 390 395 400 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 
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405 








Gin 


Leu 


Val He Gin 


Gin 


Gin 


His 






420 








Gin 


Tyr 


Gin Gin Gin 


He 


His 


Met 






435 






440 


Glu 


Gin 


Leu Lys Gin 


Pro Gly 


Ser 




450 






455 




Leu 


Gin 


Gly Asp Gin 


Ala 


Met 


Gin 


465 






470 






Asn 


Ser 


Thr Arg Ser 


Asp 


Ser 


Ser 






485 








Gin 


Val 


Gly Ala Val 


Lys Val 


Lys 






500 








Asp 


Ala 


Gin He Gin 


Glu 


Met 


Glu 






515 






520 


Gin 


Gin 


Val He Gly 


Lys 


Asp 


Leu 




530 






535 




He 


He 










545 
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410 






415 




Gin Gin 


Phe 


Leu 


Glu Lys Gin Lys 


425 






430 




Asn Lys 


Leu 


Leu 


Ser Lys Ser 


He 








445 




His Leu 


Glu 


Glu 


Ala Glu Glu 


Glu 






460 






Glu Asp 


Arg 


Ala 


Pro Ser Ser 


Gly 




475 






480 


Ala Cys 


Val Asp 


Asp Thr Leu Gly 


490 






495 




Glu Glu 


Pro 


Val 


Asp Ser Asp 


Glu 


505 






510 




Ser Gly 


Glu 


Gin 


Ala Ala Phe 


Met 








525 




Ala Pro 


Gly Phe 


Val He Lys 


Val 






540 







<210> 11 

<211> 590 

<212> PRT 

<213> Homo sapiens 



<400> 11 



Met 


His 


Ser 


Met 


He 


Ser 


Ser 


Val 


1 








5 








Gly 


Leu 


Glu 


Pro 


He 


Ser 


Pro 


Leu 








20 










Met 


Met 


Pro 


Val 


Val 


Asp 


Pro 


Val 






35 










40 


Glu 


Leu 


Leu 


Leu 


He 


Gin 


Gin 


Gin 




50 










55 




He 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


65 










70 






Ala 


Gin 


Leu 


Gin 


Glu 


His 


He 


Lys 










85 








Gin 


Glu 


Leu 


Leu 


Glu 


Lys 


Glu 


Gin 








100 










Gin 


Glu 


Val 


Glu 


Arg 


His Arg 


Arg 






115 










12 0 


Gly 


Lys 


Asp 


Arg 


Gly 


Arg 


Glu 


Arg 




130 










135 




Gin 


Lys 


Leu 


Gin 


Glu 


Phe 


Leu 


Leu 


145 










150 






Pro 


Thr 


Asn 


Gly 


Lys 


Asn 


His 


Ser 










165 








Tyr 


Thr 


Ala 


Ala 


His 


His 


Thr 


Ser 








180 










Ser 


Gly 


Thr 


Ser 


Pro 


Ser Tyr 


Lys 






195 










200 


Ala 


Lys 


Asp 


Asp 


Phe 


Pro 


Leu 


Arg 




210 










215 




Lys 


Val 


Arg 


Ser 


Arg 


Leu Lys 


Gin 


225 










230 






Pro 


Leu 


Leu 


Arg 


Arg 


Lys Asp 


Gly 










245 








Arg 


Met 


Phe 


Glu 


Val 


Thr 


Glu 


Ser 








260 










Ser 


Gly 


Pro 


Ser 


Ser 


Pro 


Asn 


Asn 



Asp Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 




10 










15 




Asp Leu 


Arg 


Thr 


Asp Leu Arg 


Met 


25 










30 






Val 


Arg 


Glu 


Lys 


Gin Leu 


Gin 


Gin 










45 








Gin 


Gin 


He 


Gin 


Lys 


Gin 


Leu 


Leu 








60 










Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 






75 










80 


Glu 


Leu 


Leu 


Ala 


He 


Lys 


Gin 


Gin 




90 










95 




Lys 


Leu 


Glu 


Gin 


Gin Arg Gin 


Glu 


105 










110 






Glu 


Gin 


Gin 


Leu 


Pro 


Pro 


Leu 


Arg 










125 








Ala 


Val 


Ala 


Ser 


Thr 


Glu 


Val 


Lys 








140 










Ser Lys 


Ser 


Ala 


Thr Lys Asp Thr 






155 










160 


Val 


Ser 


Arg 


His 


Pro 


Lys 


Leu 


Trp 




170 










175 




Leu 


Asp 


Gin 


Ser 


Ser 


Pro 


Pro 


Leu 


185 










190 






Tyr 


Thr 


Leu 


Pro 


Gly Ala 


Gin Asp 










205 








Lys 


Thr 


Ala 


Ser 


Glu 


Pro 


Asn 


Leu 








220 










Lys 


Val 


Ala 


Glu 


Arg 


Arg 


Ser 


Ser 






235 










240 


Asn 


Val 


Val 


Thr 


Ser 


Phe 


Lys 


Lys 




250 










255 




Ser 


Val 


Ser 


Ser 


Ser 


Ser 


Pro 


Gly 


265 










270 






Gly Pro 


Thr Gly 


Ser 


Val 


Thr 


Glu 
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275 280 285 

Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala Glu Gin Met Val 

290 295 300 

Ser Gin Gin Arg lie Leu lie His Glu Asp Ser Met Asn Leu Leu Ser 
305 310 315 320 

Leu Tyr Thr Ser Pro Ser Leu Pro Asn lie Thr Leu Gly Leu Pro Ala 

325 330 335 

Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys Glu Lys Gin Lys 

340 345 - 350 

Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu Pro Gly Gin Tyr 

355 360 365 

Gly Gly Ser lie Pro Ala Ser Ser Ser His Pro His Val Thr Leu Glu 

370 375 380 

Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu Gin His Leu Leu 
? 85 390 395 400 

Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val Ala Gly Gly Val 

405 410 415 

Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu Arg lie Ser Pro 

420 425 430 

Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg Pro Leu Asn Arg 

435 440 445 

Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala Gin Leu Val He 

450 455 460 

Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys Gin Tyr Gin Gin 
I 65 470 475 480 

Gin He His Met Asn Lys Leu Leu Ser Lys Ser He Glu Gin Leu Lys 

485 490 495 

Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu Leu Gin Glv Asd 

500 505 510 

Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly Asn Ser Thr Arg 

515 520 525 

Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly Gin Val Gly Ala 

530 535 540 

Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu Asp Ala Gin He 
545 550 555 560 

Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met Gin Gin Val He 

565 570 575 

Gly Lys Asp Leu Ala Pro Gly Phe Val He Lys Val He He 
580 585 590 

<210> 12 

<211> 1084 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Ser Ser Gin Ser His Pro Asp Gly Leu Ser Gly Arg Asp Gin Pro 

1 5 10 is 

Val Glu Leu Leu Asn Pro Ala Arg Val Asn His Met Pro Ser Thr Val 

20 25 30 

Asp Val Ala Thr Ala Leu Pro Leu Gin Val Ala Pro Ser Ala Val Pro 

35 40 45 

Met Asp Leu Arg Leu Asp His Gin Phe Ser Leu Pro Val Ala Glu Pro 

50 55 60 

Ala Leu Arg Glu Gin Gin Leu Gin Gin Glu Leu Leu Ala Leu Lys Gin 
65 70 75 80 

Lys Gin Gin lie Gin Arg Gin He Leu lie Ala Glu Phe Gin Arg Gin 

85 90 95 

His Glu Gin Leu Ser Arg Gin His Glu Ala Gin Leu His Glu His He 

100 105 no 

Lys Gin Gin Gin Glu Met Leu Ala Met Lys His Gin Gin Glu Leu Leu 
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115 






120 


Glu 


His 


Gin Arg Lys Leu Glu 


Arg 




130 






135 




Lys 


Gin 


His Arg Glu Gin Lys 


Leu 


145 






150 






Gly 


Lys 


Glu Ser Ala 


Val 


Ala 


Ser 






165 








Glu 


Phe 


Val Leu Asn Lys Lys 


Lys 






180 








His 


Cys 


He Ser Ser Asp 


Pro 


Arg 






195 






200 


Ser 


Ser 


Leu Asp Gin Ser Ser 


Pro 




210 






215 




Tyr 


Asn 


His Pro Val 


Leu Gly 


Met 


225 






230 






Leu 


Arg 


Lys Thr Ala 


Ser Glu 


Pro 






245 








Lys 


Gin 


Lys Val Ala Glu Arg 


Arg 






260 








Asp 


Gly 


Pro Val Val 


Thr Ala 


Leu 






275 






280 


Asp 


Ser 


Ala Cys Ser 


Ser 


Ala 


Pro 




290 






295 




Asn 


Ser 


Ser Gly Ser Val 


Ser 


Ala 


305 






310 






Pro 


Ser 


He Pro Ala 


Glu 


Thr 


Ser 






325 








Glu 


Gly 


Ser Ala Ala 


Pro 


Leu 


Pro 






340 








Asn 


He 


Thr Leu Gly Leu 


Pro 


Ala 






355 






360 


Gly 


Gin 


Gin Asp Thr Glu Arg 


Leu 




370 






375 




Leu 


Ser 


Leu Phe Pro Gly Thr 


His 


385 






390 






Pro 


Leu 


Glu 'Arg Asp Gly Gly 


Ala 






405 








Met 


Val 


Leu Leu Glu 


Gin 


Pro 


Pro 






420 








Leu 


Gly 


Ala Leu Pro 


Leu 


His 


Ala 






435 






440 


Val 


Ser 


Pro Ser He 


His 


Lys 


Leu 




450 






455 




Thr 


Gin 


Ser Ala Pro 


Leu 


Pro 


Gin 


465 






470 






Val 


He 


Gin Gin Gin 


His 


Gin 


Gin 






485 








Phe 


Gin 


Gin Gin Gin 


Leu 


Gin 


Met 






500 








Glu 


Pro 


Ala Arg Gin 


Pro 


Glu 


Ser 






515 






520 


Leu 


Arg 


Glu His Gin 


Ala 


Leu 


Leu 




530 






535 




Pro 


Gly 


Gin Lys Glu 


Ala 


His 


Ala 


545 






550 






Glu 


Pro 


He Glu Ser 


Asp 


Glu 


Glu 






565 








Glu 


Pro 


Gly Gin Arg 


Gin 


Pro 


Ser 






580 








Gin 


Ala 


Leu Leu Leu 


Glu 


Gin 


Gin 






595 






600 


Gin 


Ala 


Ser Met Glu 


Ala 


Ala 


Gly 
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125 



His 


Arg 


Gin 


Glu 


Gin 


Glu 


Leu 


Glu 








140 










Gin 


Gin 


.Leu Lys 


Asn 


Lys 


Glu 


Lys 






155 










160 


Thr 


Glu 


Val 


Lys 


Met 


Lys 


Leu 


Gin 




170 










175 




Ala 


Leu 


Ala 


His 


Arg 


Asn 


Leu 


Asn 


185 










190 






Tyr 


Trp 


Tyr Gly 


Lys 


Thr 


Gin 


His 










205 








Pro 


Gin 


Ser Gly 


Val 


Ser 


Thr 


Ser 








220 










Tyr 


Asp 


Ala 


Lys 


Asp 


Asp 


Phe 


Pro 






235 










240 


Asn 


Leu 


Lys 


Leu 


Arg 


Ser 


Arg 


Leu 




250 










255 




Ser 


Ser 


Pro 


Leu 


Leu 


Arg 


Arg 


Lys 


265 










270 






Lys 


Lys 


Arg 


Pro 


Leu 


Asp 


Val 


Thr 










285 








Gly 


Ser 


Gly 


Pro 


Ser 


Ser 


Pro 


Asn 








300 










Glu 


Asn 


Gly 


He 


Ala 


Pro 


Ala 


Val 






315 










320 


Leu 


Ala 


His 


Arg 


Leu 


Val 


Ala 


Arg 




330 










335 




Leu 


Tyr 


Thr 


Ser 


Pro 


Ser 


Leu 


Pro 


345 










350 






Thr 


Gly 


Pro 


Ser 


Ala 


Gly 


Thr 


Ala 










365 








Thr 


Leu 


Pro 


Ala 


Leu 


Gin 


Gin 


Arg 








380 










Leu 


Thr 


Pro 


Tyr 


Leu 


Ser 


Thr 


Ser 






395 










400 


Ala 


His 


Ser 


Pro 


Leu 


Leu 


Gin 


His 




410 










415 




Ala 


Gin 


Ala 


Pro 


Leu 


Val 


Thr 


Gly 


425 










430 






Gin 


Ser 


Leu Val 


Gly 


Ala 


Asp 


Arg 










445 








Arg 


Gin 


His 


Arg 


Pro 


Leu 


Gly 


Arg 








460 










Asn 


Ala 


Gin 


Ala 


Leu 


Gin 


His 


Leu 






475 










480 


Phe 


Leu 


Glu Lys 


His 


Lys 


Gin 


Gin 




490 










495 




Asn 


Lys 


He 


He 


Pro 


Lys 


Pro 


Ser 


505 










510 






His 


Pro 


Glu 


Glu 


Thr 


Glu 


Glu 


Glu 










525 








Asp 


Glu 


Pro 


Tyr 


Leu 


Asp 


Arg 


Leu 








540 










Gin 


Ala 


Gly Val 


Gin 


Val 


Lys 


Gin 






555 










560 


Glu 


Ala 


Glu 


Pro 


Pro 


Arg 


Glu 


Val 




570 










575 




Glu 


Gin 


Glu 


Leu 


Leu 


Phe 


Arg 


Gin 


585 










590 






Arg 


He 


His 


Gin 


Leu 


Arg 


Asn 


Tyr 










605 








He 


Pro 


Val 


Ser 


Phe 


Gly 


Gly 


His 
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610 615 620 

Arg Pro Leu Ser Arg Ala Gin Ser Ser Pro Ala Ser Ala Thr Phe Pro 
625 630 635 640 

Val Ser Val Gin Glu Pro Pro Thr Lys Pro Arg Phe Thr Thr Gly Leu 

645 650 655 

Val Tyr Asp Thr Leu Met Leu Lys His Gin Cys Thr Cys Gly Ser Ser 

66 0 665 670 

Ser Ser His Pro Glu His Ala Gly Arg He Gin Ser He Trp Ser Ara 

675 680 685 

Leu Gin Glu Thr Gly Leu Arg Gly Lys Cys Glu Cys He Arg Gly Ara 

690 695 700 

Lys Ala Thr Leu Glu Glu Leu Gin Thr Val His Ser Glu Ala His Thr 
705 710 715 720 

Leu Leu Tyr Gly Thr Asn Pro Leu Asn Arg Gin Lys Leu Asp Ser Lys 

725 730 735 

Lys Leu Leu Gly Ser Leu Ala Ser Val Phe Val Arg Leu Pro Cys Gly 

740 745 750 

Gly Val Gly Val Asp Ser Asp Thr He Trp Asn Glu Val His Ser Ala 

755 760 765 

Gly Ala Ala Arg Leu Ala Val Gly Cys Val Val Glu Leu Val Phe Lys 

770 775 780 

Val Ala Thr Gly Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro 

790 795 800 

Gly His His Ala Glu Glu Ser Thr Pro Met Gly Phe Cys Tyr Phe Asn 

805 aio ~ 815 

Ser Val Ala Val Ala Ala Lys Leu Leu Gin Gin Arg Leu Ser Val Ser 

820 825 830 

Lys. He Leu He Val Asp Trp Asp Val His His Gly Asn Gly Thr Gin 

835 840 845 

Gin Ala Phe Tyr Ser Asp Pro Ser Val Leu Tyr Met Ser Leu His Arg 

850 855 860 

Tyr Asp Asp Gly Asn Phe Phe Pro Gly Ser Gly Ala Pro Asp Glu Val 
5 8 70 875 880 

Gly Thr Gly Pro Gly Val Gly Phe Asn Val Asn Met Ala Phe Thr Gly 

885 890 895 

Gly Leu Asp Pro Pro Met Gly Asp Ala Glu Tyr Leu Ala Ala Phe Arg 

900 905 910 

Thr Val Val Met Pro He Ala Ser Glu Phe Ala Pro Asp Val Val Leu 

915 920 925 

Val Ser Ser Gly Phe Asp Ala Val Glu Gly His Pro Thr Pro Leu Gly 
930 935 94Q y 

Gly Tyr Asn Leu Ser Ala Arg Cys Phe Gly Tyr Leu Thr Lys Gin Leu 

945 950 955 960 

Met Gly Leu Ala Gly Gly Arg He Val Leu Ala Leu Glu Gly Gly His 

965 970 975 

Asp Leu Thr Ala He Cys Asp Ala Ser Glu Ala Cys Val Ser Ala Leu 

980 985 990 

Leu Gly Asn Glu Leu Asp Pro Leu Pro Glu Lys Val Leu Gin Gin Arg 

995 1000 1005 

Pr ° ™ Ala Asn Ala Val ^ Ser Met Glu ^ Val Met Glu He His 

1010 1015 1020 

Ser Lys Tyr Trp Arg Cys Leu Gin Arg Thr Thr Ser Thr Ala Gly Arg 
1025 1030 1035 1040 

Ser Leu He Glu Ala Gin Thr Cys Glu Asn Glu Glu Ala Glu Thr Val 

1045 1050 1055 

Thr Ala Met Ala Ser Leu Ser Val Gly Val Lys Pro Ala Glu Lys Arg 

106 0 1065 1070 

Pro Asp Glu Glu Pro Met Glu Glu Glu Pro Pro Leu 
1075 1080 



<210> 13 
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<211> 3550 

<212> DNA 

<213> Homo sapiens 

<400> 13 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgcctct 
aaacagaaag tggcagagag gagaagcagc 
gtcacttcat tcaagaagcg aatgtttgag 
ccaggctctg gtcccagttc accaaacaat 
acttcggttt tgccccctac ccctcatgcc 
attcatgaag attccatgaa cctgctaagt 
accttggggc ttcccgcagt gccatcccag 
cagaagtgtg agacgcagac gcttaggcaa 
agcatcccgg catcttccag ccaccctcat 
agccaccagg ctctcctgca gcatttatta 
cttgtagctg gtggagttcc cttacatcct 
tcacctggca ttagaggtac ccacaaattg 
tctgcacctt tgcctcagag cacgttggct 
ttcttggaga agcagaagca ataccagcag 
tctattgaac aactgaagca accaggcagt 
ggggaccagg cgatgcagga agacagagcg 
agcagtgctt gtgtggatga cacactggga 
ccagtggaca gtgatgaaga tgctcagatc 
tttatgcaac aggtaatagg caaagattta 
tgacctttcc tggaacccac gcacacacgt 
gcggttggca tggatggatt agagaaacac 
gctgcctctg ttttacctca cccagcaatg 
ggaattgcct atgacccctt gatgctgaaa 
caccctgagc atgctggacg aatacagagt 
ctaaataaat gtgagcgaat tcaaggtcga 
cattctgaac atcactcact gttgtatggc 
cccaggatac tcctaggtga tgactctcaa 
cttggggtgg acagtgacac catttggaat 
gctgttggct gtgtcatcga gctggcttcc 
tttgctgttg tgaggccccc tggccatcac 
ttttttaatt cagttgcaat taccgccaaa 
atattgattg tagatctgga tgttcaccat 
gaccccagca tcctgtacat ttcactccat 
agtggagccc caaatgaggt tcggtttatt 
tcaggtaatt gcattgcatg attaccccta 
tacacgagat tactgaattg tcccatggga 
ccagagcacfc gtttgtcagg gaaggttggg 
caagagctcc catgtgcttg ttttcctctc 
tgcccaccgt ggtgtgtctt tctcttccca 
tataaatatt gcctggacag gtggccttga 
agcattcagg accatcgtga agcctgtggc 
atctgctgga tttgatgcat tggaaggcca 
ggcaaaatgt tttggtcatt tgacgaagca 
gttggctcta gaaggaggac atgatctcac 



19/25 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
gagcccaact tgaaggtgcg gtccaggtta 840 
cccttactca ggcggaagga tggaaatgtt 900 
gtgacagaat cctcagtcag tagcagttct 960 
gggccaactg gaagtgttac tgaaaatgag 1020 
gagcaaatgg tttcacagca acgcattcta 1080 
ctttatacct ctccttcttt gcccaacatt 1140 
ctcaatgctt cgaattcact caaagaaaag 1200 
ggtgttcctc tgcctgggca gtatggaggc 1260 
gttactttag agggaaagcc acccaacagc 1320 
ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cagtctccct tggcaacaaa agagagaatt 1440 
ccccgtcaca gacccctgaa ccgaacccag 1500 
cagctggtca ttcaacagca acaccagcaa 15 60 
cagatccaca tgaacaaact gctttcgaaa 1620 
caccttgagg aagcagagga agagcttcag 1680 
ccctctagtg gcaacagcac taggagcgac 1740 
caagttgggg ctgtgaaggt caaggaggaa 1800 
caggaaatgg aatctgggga gcaggctgct 1860 
gctccaggat ttgtaattaa agtcattatc 1920 
gcgctctctg tgcgccaagc tccgctggct 1980 
cgtctcgtct ccaggactca ctcttcccct 2040 
gaccgccccc tccagcctgg ctctgcaact 2100 
caccagtgcg tttgtggcaa ttccaccacc 2160 
atctggtcac gactgcaaga aactgggctg 2220 
aaagccagcc tggaggaaat acagcttgtt 2280 
accaaccccc tggacggaca gaagctggac 2340 
aagttttttt cctcattacc ttgtggtgga 2400 
gagctacact cgtccggtgc tgcacgcatg 2460 
aaagtggcct caggagagct gaagaatggg 2520 
gctgaagaat ccacagccat ggggttctgc 2580 
tacttgagag accaactaaa tataagcaag 2 640 
ggaaacggta cccagcaggc cttttatgct 2700 
cgctatgatg aagggaactt tttccctggc 2760 
tctttagagc cccactttta tttgtatctt 2820 
attttcttgt cctttgctgg tgttttaaat 2880 
ccaagaacca gtgcagaaca agtgcataac 2940 
ctgatttgat gtgttgtttg atgtttattt 3000 
ttcttgcttt cttccatttg ctctcttctc 3060 
ggttggaaca ggccttggag aagggtacaa 3120 
tcctcccatg ggagatgttg agtaccttga 3180 
caaagagttt gatccagaca tggtcttagt 3240 
cacccctcct ctaggagggt acaaagtgac 3300 
attgatgaca ttggctgatg gacgtgtggt 3360 
agccatctgt gatgcatcag aagcctgtgt 3420 
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aaatgccctt ctaggaaatg agctggagcc acttgcagaa gatattctcc accaaagccc 3480 
gaatatgaat gctgttattt ctttacagaa gatcattgaa attcaaagta tgtctttaaa 3540 
gttctcttaa 3550 

<210> 14 

<211> 7699 , 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cccattcgcc attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 60 
tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccca 120 
gggttttccc agtcacgacg ttgtaaaacg acggccagtg ccaagctgat ctaatcaata 180 
ttggccatfca gccatattat tcattggtta tatagcataa atcaatattg gctattggcc 240 
attgcatacg ttgtatccat atcataatat gtacatttat attggctcat gtccaacatt 300 
accgccatgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 360 
agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 420 
cgaccgccca gcgacccccg cccgttgacg tcaatagtga cgtatgttcc catagtaacg 480 
ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg 540 
gcagtacatc aagtgtatca tatgccaagt ccgcccccta ttgacgtcaa tgacggtaaa 600 
tggcccgcct agcattatgc ccagtacatg accttacggg agtttcctac ttggcagtac 660 
atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta caccaatggg 720 
cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga cgtcaatggg 780 
agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaataa ccccgccccg 840 
ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag agctcgttta 900 
gtgaaccgtc agaattcaag cttgcggccg cagatctatc gatctgcagg atatcaccat 960 
gcacagtatg atcagctcag tggatgtgaa gtcagaagtt cctgtgggcc tggagcccat 1020 
ctcaccttta gacctaagga cagacctcag gatgatgatg cccgtggtgg accctgttgt 1080 
ccgtgagaag caattgcagc aggaattact tcttatccag cagcagcaac aaatccagaa 1140 
gcagcttctg atagcagagt ttcagaaaca gcatgagaac ttgacacggc agcaccaggc 1200 
tcagcttcag gagcatatca aggaacttct agccataaaa cagcaacaag aactcctaga 1260 
aaaggagcag aaactggagc. agcagaggca agaacaggaa gtagagaggc atcgcagaga 1320 
acagcagctt cctcctctca gaggcaaaga tagaggacga gaaagggcag tggcaagtac 1380 
agaagtaaag cagaagcttc aagagttcct actgagtaaa tcagcaacga aagacactcc 1440 
aactaatgga aaaaatcatt ccgtgagccg ccatcccaag ctctggtaca cggctgccca 1500 
ccacacatca ttggatcaaa gctctccacc ccttagtgga- acatctccat cctacaagta 1560 
cacattacca ggagcacaag atgcaaagga tgatttcccc cttcgaaaaa ctgcctctga 1620 
gcccaacttg aaggtgcggt ccaggttaaa acagaaagtg gcagagagga gaagcagccc 1680 
cttactcagg cggaaggatg gaaatgttgt cacttcattc aagaagcgaa tgtttgaggt 1740 
gacagaatcc tcagtcagta gcagttctcc aggctctggt cccagttcac caaacaatgg 1800 
gccaactgga agtgttactg aaaatgagac ttcggttttg ccccctaccc ctcatgccga 1860 
gcaaatggtt tcacagcaac gcattctaat tcatgaagat tccatgaacc tgctaagtct 1920 
ttatacctct ccttctttgc ccaacattac cttggggctt cccgcagtgc catcccagct 1980 
caatgcttcg aattcactca aagaaaagca gaagtgtgag acgcagacgc ttaggcaagg 2040 
tgttcctctg cctgggcagt atggaggcag catcccggca tcttccagcc accctcatgt 2100 
tactttagag ggaaagccac ccaacagcag ccaccaggct ctcctgcagc atttattatt 2160 
gaaagaacaa atgcgacagc aaaagcttct tgtagctggt ggagttccct tacatcctca 2220 
gtctcccttg gcaacaaaag agagaatttc acctggcatt agaggtaccc acaaattgcc 2280 
ccgtcacaga cccctgaacc gaacccagtc tgcacctttg cctcagagca cgttggctca 2340 
gctggtcatt caacagcaac accagcaatt cttggagaag cagaagcaat accagcagca 2400 
gatccacatg aacaaactgc tttcgaaatc tattgaacaa ctgaagcaac caggcagtca 2460 
ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg atgcaggaag acagagcgcc 2520 
ctctagtggc aacagcacta ggagcgacag cagtgcttgt gtggatgaca cactgggaca 2580 
agttggggct gtgaaggtca aggaggaacc agtggacagt gatgaagatg ctcagatcca 2640 
ggaaatggaa tctggggagc aggctgcttt tatgcaacag cctttcctgg aacccacgca 2700 
cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg gttggcatgg atggattaga 2760 
gaaacaccgt ctcgtctcca ggactcactc ttcccctgct gcctctgttt tacctcaccc 2820 
agcaatggac cgccccctcc agcctggctc tgcaactgga attgcctatg accccttgat 2880 
gctgaaacac cagtgcgttt gtggcaattc caccacccac cctgagcatg ctggacgaat 2940 
acagagtatc tggtcacgac tgcaagaaac tgggctgcta aataaatgtg agcgaattca 3000 
aggtcgaaaa gccagcctgg aggaaataca gcttgttcat tctgaacatc actcactgtt 3060 
gtatggcacc aaccccctgg acggacagaa gctggacccc aggatactcc taggtgatga 3120 
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ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttttt ccctggcagt ggagccccaa atgaggttgg 3540 
aacaggcctt ggagaagggt acaatataaa tattgcctgg acaggtggcc ttgatcctcc 3 600 
catgggagat gttgagtacc ttgaagcatt caggaccatc gtgaagcctg tggccaaaga 3 660 
gtttgatcca gacatggtct tagtatctgc tggatttgat gcattggaag gccacacccc 3720 
tcctctagga gggtacaaag tgacggcaaa atgttttggt catttgacga agcaattgat 3780 
gacattggct gatggacgtg tggtgttggc tctagaagga ggacatgatc tcacagccat 3 840 
ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga aatgagctgg agccacttgc 3900 
agaagatatt ctccaccaaa gcccgaatat gaatgctgtt atttctttac agaagatcat 3960 
tgaaattcaa agtatgtctt taaagttctc tggatccggt accagattac aaggacgacg 4020 
atgacaagta gatcccgggt ggcatccctg tgacccctcc ccagtgcctc tcctggcctt 4080 
ggaagttgcc actccagtgc ccaccagcct tgtcctaata aaattaagtt gcatcatttt 4140 
gtctgactag gtgtcctcta taatattatg gggtggaggg gggtggtatg gagcaagggg 4200 
cccaagttgg gaagacaacc tgtagggcct gcggggtcta ttcgggaacc aagctggagt 4260 
gcagtggcac aatcttggct cactgcaatc tccgcctcct gggttcaagc gattctcctg 4320 
cctcagcctc ccgagttgtt gggattccag gcatgcatga ccaggctcag ctaatttttg 4380 
tttttttggt agagacgggg tttcaccata ttggccaggc tggtctccaa ctcctaatct 4440 
caggtgatct acccaccttg gcctcccaaa ttgctgggat tacaggcgtg aaccactgct 4500 
cccttccctg tccttctgat tttaaaataa ctataccagc aggaggacgt ccagacacag 4560 
cataggctac ctgccatggc ccaaccggtg ggacatttga gttgcttgct tggcactgtc 4620 
ctctcatgcg ttgggtccac tcagtagatg cc tgttgaat tgggtacgcg gccagcttct 4680 
gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag gcagaagtat 4740 
gcaaagcatg catctcaatt agtcagcaac caggtgtgga aaagtcccca ggctccccag 4800 
caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc ccgcccctaa 4860 
ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc catggctgac 4920 
taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta ttccagaagt 4980 
agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcctcgag gaactgaaaa 5040 
accagaaagt taattcccta tagtgagtcg tattaaattc gtaatcatgg tcatagctgt 5100 
ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 5160 
agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 5220 
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 5280 
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 5340 
gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 5400 
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 5460 
ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 5520 
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 5580 
aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 5640 
gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc tcacgctgta 5700 
ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 5760 
ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 5820 
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 5880 
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 5940 
ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 6000 
ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 6060 
gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 6120 
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 6180 
agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 6240 
ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 6300 
gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 6360 
catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 6420 
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 6480 
cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 6540 
gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 6600 
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 6660 
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 6720 
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 6780 
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 6840 
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gaccgagttg ctcttgcccg gcgtcaatac 
taaaagtgct catcattgga aaacgttctt 
tgttgagatc cagttcgatg taacccactc 
ctttcaccag cgtttctggg tgagcaaaaa 
taagggcgac acggaaatgt tgaatactca 
tttatcaggg ttattgtctc atgagcggat 
aaataggggt tccgcgcaca tttccccgaa 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 
cggcctattg gttaaaaaat gagctgattt 
tattaaacgt ttacaattt 



gggataatac cgcgccacat agcagaactt 6900 
cggggcgaaa actctcaagg atcttaccgc 6960 
gtgcacccaa ctgatcttca gcatctttta 7020 
caggaaggca aaatgccgca aaaaagggaa 7080 
tactcttcct ttttcaatat tattgaagca 7140 
acatatttga atgtatttag aaaaataaac 7200 
aagtgccacc tgacgcgccc tgtagcggcg 7260 
gcagcgtgac cgctacactt gccagcgccc 7320 
cctttctcgc cacgttcgcc ggctttcccc 7380 
ggttccgatt tagtgcttta cggcacctcg 7440 
cacgtagtgg gccatcgccc tgatagacgg 7500 
tctttaatag tggactcttg ttccaaactg 7560 
cttttgattt ataagggatt ttgccgattt 7620 
aacaaaaatt taacgcgaat tttaacaaaa 7680 

7699 



<210> 15 

<211> 7303 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac -gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 300 
ttattaatag taatcaatta cggggtcatt 360 
tacataacfct acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
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ccgtcacaga cccctgaacc gaacccagtc 
gctggtcatt caacagcaac accagcaatt 
gatccacatg aacaaactgc tttcgaaatc 
ccttgaggaa gcagaggaag agcttcaggg 
ctctagtggc aacagcacta ggagcgacag 
agttggggct gtgaaggtca aggaggaacc 
ggaaatggaa tctggggagc aggctgcttt 
cacacgtgcg ctctctgtgc gccaagctcc 
gaaacaccgt ctcgtctcca ggactcactc 
agcaatggac cgccccctcc agcctggctc 
gctgaaacac cagtgcgttt gtggcaattc 
acagagtatc tggtcacgac tgcaagaaac 
aggtcgaaaa gccagcctgg aggaaataca 
gtatggcacc aaccccctgg acggacagaa 
ctctcaaaag tttttttcct cattaccttg 
ttggaatgag ctacactcgt ccggtgctgc 
ggcttccaaa gtggcctcag gagagctgaa 
ccatcacgct gaagaatcca cagccatggg 
cgccaaatac ttgagagacc aactaaatat 
tcaccatgga aacggtaccc agcaggcctt 
actccatcgc tatgatgaag ggaacttttt 
gtttatttct ttagagcccc acttttattt 
cggtaccaga ttacaaggac gacgatgaca 
ctccccagtg cctctcctgg ccttggaagt 
aataaaatta agttgcatca ttttgtctga 
aggggggtgg tatggagcaa ggggcccaag 
tctattcggg aaccaagctg gagtgcagtg 
tcctgggttc aagcgattct cctgcctcag 
atgaccaggc tcagctaatt tttgtttttt 
aggctggtct ccaactccta atctcaggtg 
ggattacagg cgtgaaccac tgctcccttc 
cagcaggagg acgtccagac acagcatagg 
ttgagttgct tgcttggcac tgtcctctca 
gaattgggta cgcggccagc ttctgtggaa 
aggctcccca gcaggcagaa gtatgcaaag 
tggaaaagtc cccaggctcc ccagcaggca 
cagcaaccat agtcccgccc ctaactccgc 
cccattctcc gccccatggc tgactaattt 
cggcctctga gctattccag aagtagtgag 
aaaagctcct cgaggaactg aaaaaccaga 
attcgtaatc atggtcatag ctgtttcctg 
acaacatacg agccggaagc ataaagtgta 
tcacattaat tgcgttgcgc tcactgcccg 
tgcattaatg aatcggccaa cgcgcgggga 
cttcctcgct cactgactcg ctgcgctcgg 
actcaaaggc ggtaatacgg ttatccacag 
gagcaaaagg ccagcaaaag gccaggaacc 
ataggctccg cccccctgac gagcatcaca 
acccgacagg actataaaga taccaggcgt 
ctgttccgac cctgccgctt accggatacc 
cgctttctca atgctcacgc tgtaggtatc 
tgggctgtgt gcacgaaccc cccgttcagc 
gtcttgagtc caacccggta agacacgact 
ggattagcag agcgaggtat gtaggcggtg 
acggctacac tagaagaaca gtatttggta 
gaaaaagagt tggtagctct tgatccggca 
ttgtttgcaa gcagcagatt acgcgcagaa 
tttctacggg gtctgacgct cagtggaacg 
gattatcaaa aaggatcttc acctagatcc 
tctaaagtat atatgagtaa acttggtctg 
ctatctcagc gatctgtcta tttcgttcat 
taactacgat acgggagggc ttaccatctg 



tgcacctttg cctcagagca cgttggctca 2340 
cttggagaag cagaagcaat accagcagca 2400 
tattgaacaa ctgaagcaac caggcagtca 2460 
ggaccaggcg atgcaggaag acagagcgcc 2520. 
cagtgcttgt gtggatgaca cactgggaca 2580 
agtggacagt gatgaagatg ctcagatcca 2640 
tatgcaacag cctttcctgg aacccacgca 2700 
gctggctgcg gttggcatgg atggattaga 2760 
ttcccctgct gcctctgttt tacctcaccc 2820 
tgcaactgga attgcctatg accccttgat 2880 
caccacccac cctgagcatg ctggacgaat 2940 
tgggctgcta aataaatgtg agcgaattca 3000 
gcttgttcat tctgaacatc actcactgtt 3060 
gctggacccc aggatactcc taggtgatga 3120 
tggtggactt ggggtggaca gtgacaccat 3180 
acgcatggct gttggctgtg tcatcgagct 3240 
gaatgggttt gctgttgtga ggccccctgg 3300 
gttctgcttt tttaattcag ttgcaattac 3360 
aagcaagata ttgattgtag atctggatgt 3420 
ttatgctgac cccagcatcc tgtacatttc 3480 
ccctggcagt ggagccccaa atgaggttcg 3540 
gtatctttca ggtaattgca ttgcaggatc 3600 
agtagatccc gggtggcatc cctgtgaccc 3660 
tgccactcca gtgcccacca gccttgtcct 3720 
ctaggtgtcc tctataatat tatggggtgg 3780 
ttgggaagac aacctgtagg gcctgcgggg 3840 
gcacaatctt ggctcactgc aatctccgcc 3900 
cctcccgagt tgttgggatt ccaggcatgc 3960 
tggtagagac ggggtttcac catattggcc 4020 
atctacccac cttggcctcc caaattgctg 4080 
cctgtccttc tgattttaaa ataactatac 4140 
ctacctgcca tggcccaacc ggtgggacat 4200 
tgcgttgggt ccactcagta gatgcctgtt 4260 
tgtgtgtcag ttagggtgtg gaaagtcccc 4320 
catgcatctc aattagtcag caaccaggtg 4380 
gaagtatgca aagcatgcat ctcaattagt 4440 
ccatcccgcc cctaactccg cccagttccg 4500 
tttttattta tgcagaggcc gaggccgcct 4560 
gaggcttttt tggaggccta ggcttttgca 4620 
aagttaattc cctatagtga gtcgtattaa 4680 
tgtgaaattg ttatccgctc acaattccac 4740 
aagcctgggg tgcctaatga gtgagctaac 4800 
ctttccagtc gggaaacctg tcgtgccagc 4860 
gaggcggttt gcgtattggg cgctcttccg 4920 
tcgttcggct gcggcgagcg gtatcagctc 4980 
aatcagggga taacgcagga aagaacatgt 5040 
gtaaaaaggc cgcgttgctg gcgtttttcc 5100 
aaaatcgacg ctcaagtcag aggtggcgaa 5160 
ttccccctgg aagctccctc gtgcgctctc 5220 
tgtccgcctt tctcccttcg ggaagcgtgg 5280 
tcagttcggt gtaggtcgtt cgctccaagc 5340 
ccgaccgctg cgccttatcc ggtaactatc 5400 
tatcgccact ggcagcagcc actggtaaca 5460 
ctacagagtt cttgaagtgg tggcctaact 5520 
tctgcgctct gctgaagcca gttaccttcg 5580 
aacaaaccac cgctggtagc ggtggttttt 5640 
aaaaaggatc tcaagaagat cctttgatct 5700 
aaaactcacg ttaagggatt ttggtcatga 5760 
ttttaaatta aaaatgaagt tttaaatcaa 5820 
acagttacca atgcttaatc agtgaggcac 5880 
ccatagttgc ctgactcccc gtcgtgtaga 5940 
gccccagtgc tgcaatgata ccgcgagacc 6000 
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cacgctcacc ggctccagat ttatcagcaa 
gaagtggtcc tgcaacttta tccgcctcca 
gagtaagtag ttcgccagtt aatagtttgc 
tggtgtcacg ctcgtcgttt ggtatggctt 
gagttacafcg atcccccatg ttgtgcaaaa 
ttgtcagaag taagttggcc gcagtgttat 
ctcttactgt catgccatcc gtaagatgct 
cattctgaga atagtgtatg cggcgaccga 
ataccgcgcc acatagcaga actttaaaag 
gaaaactctc aaggatctta ccgctgttga 
ccaactgatc ttcagcatct tttacttfcca 
ggcaaaatgc cgcaaaaaag ggaataaggg 
tcctttttca atattattga agcatttatc 
ttgaatgtat ttagaaaaat aaacaaatag 
cacctgacgc gccctgtagc ggcgcattaa 
tgaccgctac acttgccagc gccctagcgc 
tcgccacgtt cgccggcttt ccccgtcaag 
gatttagtgc tttacggcac ctcgacccca 
gtgggccatc gccctgatag acggtttttc 
atagtggact cttgttccaa actggaacaa 
atttataagg gattttgccg atttcggcct 
aatttaacgc gaattttaac aaaatattaa 



taaaccagcc agccggaagg gccgagcgca 6060 
tccagtctat taattgttgc cgggaagcta 6120 
gcaacgttgt tgccattgct acaggcatcg 6180 
cattcagctc cggttcccaa cgatcaaggc 6240 
aagcggttag ctccttcggt cctccgatcg 6300 
cactcatggt tatggcagca ctgcataatt 63 60 
tttctgtgac tggtgagtac tcaaccaagt 6420 
gttgctcttg cccggcgtca atacgggata 6480 
tgctcatcat tggaaaacgt tcttcggggc 6540 
gatccagttc gatgtaaccc actcgtgcac 6600 
ccagcgtttc tgggtgagca aaaacaggaa 6660 
cgacacggaa atgttgaata ctcatactct 6720 
agggttattg tctcatgagc ggatacatat 6780 
gggttccgcg cacatttccc cgaaaagtgc 6840 
gcgcggcggg tgtggtggtt acgcgcagcg 6900 
ccgctccttt cgctttcttc ccttcctttc 6960 
ctctaaatcg gggcatccct ttagggttcc 7020 
aaaaacttga ttagggtgat ggttcacgta 7080 
gccctttgac gttggagtcc acgttcttta 7140 
cactcaaccc tatctcggtc tattcttttg 7200 
attggttaaa aaatgagctg atttaacaaa 7260 
acgtttacaa ttt 7303 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 16 

ccatggaaac ggtacccagc aggc 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 17 

cactccatcg ctatgatgaa ggg 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 18 

agttcccttc atcatagcga tgg 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer used to amplify human DNA 
<400> 19 

aatgtacagg atgctggggt 

<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 20 

cccttgtagc tggtggagtt ccctt 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 21 

tgtgtcatcg agctggcttc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 22 

atcttctgca agtggctcca 



WO 02/102984 



PCT/US02/19051 



7/25 



Thr His 
610 
Arg Pro 
625 

Met Leu 

His Ala 

Leu Leu 

Glu lie 
690 
Asn Pro 
705 

Asp Ser 

Asp Ser 

Met Ala 

Glu Leu 
770 
Glu Glu 
785 

Thr Ala 

Val Asp 

Ala Asp 

Asn Phe 
850 
Leu Glu 
865 



Ser Ser Pro Ala 
Leu Gin 
Lys His 



Gly Arg 
660 
Asn Lys 
675 

Gin Leu 

Leu Asp 

Gin Lys 

Asp Thr 
740 
Val Gly 
755 

Lys Asn 

Ser Thr 

Lys Tyr 

Leu Asp 
820 
Pro Ser 
835 

Phe Pro 
Pro His 



Pro Gly 
630 
Gin Cys 
645 

He Gin 



Cys Glu 

Val His 

Gly Gin 
710 
Phe Phe 
725 

He Trp 
Cys Val 
Gly Phe 



Ala Ser Val 
615 

Ser Ala Thr 

Val Cys Gly 

Ser He Trp 
665 

Arg He Gin 

680 
Ser Glu His 
695 

Lys Leu Asp 

Ser Ser Leu 

Asn Glu Leu 
745 

He Glu Leu 

760 
Ala Val Val 
775 

Gly Phe Cys 



Ala Met 
790 

Leu Arg Asp Gin Leu 
805 

Val His 



lie Leu 

Gly Ser 

Phe Tyr 
870 



His Gly Asn 
825 

Tyr He Ser 

840 
Gly Ala Pro 
855 

Leu Tyr Leu 



Leu Pro 

Gly He 
635 
Asn Ser 
650 

Ser Arg 

Gly Arg 

His Ser 

Pro Arg 
715 
Pro Cys 
730 

His Ser 

Ala Ser 

Arg Pro 

Phe Phe 
795 
Asn He 
810 

Gly Thr 

Leu His 

Asn Glu 

Ser Gly 
875 



His Pro Ala 
620 

Ala Tyr Asp 

Thr Thr His 

Leu Gin Glu 
670 

Lys Ala Ser 
685 

Leu Leu Tyr 
700 

He Leu Leu 

Gly Gly Leu 

Ser Gly Ala 
750 

Lys Val Ala 

765 
Pro Gly His 
780 

Asn Ser Val 

Ser Lys He 

Gin Gin Ala 
830 . 

Arg Tyr Asp 

845 
Val Arg Phe 
860 

Asn Cys He 



Met Asp 

Pro Leu 
640 
Pro Glu 
655 

Thr Gly 

Leu Glu 

Gly Thr 

Gly Asp 
720 
Gly Val 
735 

Ala Arg 

Ser Gly 

His Ala 

Ala He 
800 
Leu lie 
815 

Phe Tyr 
Glu Gly 
He Ser 
Ala 



<210> 5 

<211> 3054 

<212> DNA 

<213> Homo sapiens 



<400> 5 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgaatcc 
cccagttcac caaacaatgg gccaactgga 
ccccctaccc ctcatgccga gcaaatggtt 
tccatgaacc tgctaagtct ttatacctct 
cccgcagtgc catcccagct caatgcttcg 
acgcagacgc ttaggcaagg tgttcctctg 
tcttccagcc accctcatgt tactttagag 



aagggcaccg gc tggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
tcagtcagta gcagttctcc aggctctggt 840 
agtgttactg aaaatgagac ttcggttttg 900 
tcacagcaac gcattctaat tcatgaagat 960 
ccttctttgc ccaacattac cttggggctt 1020 
aattcactca aagaaaagca gaagtgtgag 1080 
cctgggcagt atggaggcag catcccggca 1140 
ggaaagccac ccaacagcag ccaccaggct 1200 
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ctcctgcagc 
ggagttccct 
agaggtaccc 
cctcagagca 
cagaagcaat 
ctgaagcaac 
atgcaggaag 
gtggatgaca 
gatgaagatg 
cctttcctgg 
gttggcatgg 
gcctctgttt 
attgcctatg 
cctgagcatg 
aataaatgtg 
tctgaacatc 
aggatactcc 
ggggtggaca 
gttggctgtg 
gctgttgtga 
tttaattcag 
ttgattgtag 
cccagcatcc 
ggagccccaa 
acaggtggcc 
gtgaagcctg 
gcattggaag 
catttgacga 
ggacatgatc 
aatgagctgg 
atttctttac 



atttattatt 
tacatcctca 
acaaattgcc 
cgttggctca 
accagcagca 
caggcagtca 
acagagcgcc 
cactgggaca 
ctcagatcca 
aacccacgca 
atggattaga 
tacctcaccc 
accccttgat 
ctggacgaat 
agcgaattca 
actcactgtt 
taggtgatga 
gtgacaccat 
tcatcgagct 
ggccccctgg 
ttgcaattac 
atctggatgt 
tgtacatttc 
atgaggttgg 
ttgatcctcc 
tggccaaaga 
gccacacccc 
agcaattgat 
tcacagccat 
agccacttgc 
agaagatcat 



gaaagaacaa 
gtctcccttg 
ccgtcacaga 
gctggtcatt 
gatccacatg 
ccttgaggaa 
ctctagtggc 
agttggggct 
ggaaatggaa 
cacacgtgcg 
gaaacaccgt 
agcaatggac 
gctgaaacac 
acagagtatc 
aggtcgaaaa 
gtatggcacc 
ctctcaaaag 
ttggaatgag 
ggcttccaaa 
ccatcacgct 
cgccaaatac 
tcaccatgga 
actccatcgc 
aacaggcctt 
catgggagat 
gtttgatcca 
tcctctagga 
gacattggct 
ctgtgatgca 
agaagatatt 
tgaaattcaa 



atgcgacagc 
gcaacaaaag 
cccctgaacc 
caacagcaac 
aacaaactgc 
gcagaggaag 
aacagcacta 
gtgaaggtca 
tctggggagc 
ctctctgtgc 
ctcgtctcca 
cgccccctcc 
cagtgcgttt 
tggtcacgac 
gccagcctgg 
aaccccctgg 
tttttttcct 
ctacactcgt 
gtggcctcag 
gaagaatcca 
ttgagagacc 
aacggtaccc 
tatgatgaag 
ggagaagggt 
gttgagtacc 
gacatggtct 
gggtacaaag 
gatggacgtg 
tcagaagcct 
ctccaccaaa 
agtatgtctt 



aaaagcttct 
agagaatttc 
gaacccagtc 
accagcaatt 
tttcgaaatc 
agcttcaggg 
ggagcgacag 
aggaggaacc 
aggctgcttt 
gccaagctcc 
ggactcactc 
agcctggctc 
gtggcaattc 
tgcaagaaac 
aggaaataca 
acggacagaa 
cattaccttg 
ccggtgctgc 
gagagctgaa 
cagccatggg 
aactaaatat 
agcaggcctt 
ggaacttttt 
acaatataaa 
ttgaagcatt 
tagtatctgc 
tgacggcaaa 
tggtgttggc 
gtgtaaatgc 
gcccgaatat 
taaagttctc 



tgtagctggt 
acctggcatt 
tgcacctttg 
cttggagaag 
tattgaacaa 
ggaccaggcg 
cagtgcttgt 
agtggacagt 
tatgcaacag 
gctggctgcg 
ttcccctgct 
tgcaactgga 
caccacccac 
tgggctgcta 
gcttgttcat 
gctggacccc 
tggtggactt 
acgcatggct 
gaatgggttt 
gttctgcttt 
aagcaagata 
ttatgctgac 
ccctggcagt 
tattgcctgg 
caggaccatc 
tggatttgat 
atgttttggt 
tctagaagga 
ccttctagga 
gaatgctgtt 
ttaa 



1260- 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3054 



<210> 6 
<211> 967 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met His Ser Met He Ser Ser Val Asp Val 

15 10 
Gly Leu Glu Pro He Ser Pro Leu Asp Leu 

20 25 
Met Met Pro Val Val Asp Pro Val Val Arg 

35 40 
Glu Leu Leu Leu He Gin Gin Gin Gin Gin 

50 55 
He Ala Glu Phe Gin Lys Gin His Glu Asn 
65 70 

Ala Gin Leu Gin Glu His He Lys Glu Leu 

85 90 
Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu 

100 105 
Gin Glu Val Glu Arg His Arg Arg Glu Gin 

115 120 
Gly Lys Asp Arg Gly Arg Glu Arg Ala Val 

130 135 
Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys 
145 150 
Pro Thr Asn Gly Lys Asn His Ser Val Ser 
165 170 
Tyr Thr Ala Ala His His Thr Ser Leu Asp 
180 185 



Lys Ser Glu Val 
Arg Thr 
Glu Lys 



He Gin 

60 
Leu Thr 
75 

Leu Ala 

Glu Gin 

Gin Leu 

Ala Ser 
140 
Ser Ala 
155 

Arg His 
Gin Ser 



Asp Leu 

30 
Gin Leu 
45 

Lys Gin 



Arg Gin 

He Lys 

Gin Arg 
110 
Pro Pro 
125 

Thr Glu 

Thr Lys 

Pro Lys 

Ser Pro 
190 



Pro Val 
15 

Arg Met 

Gin Gin 

Leu Leu 

His Gin 
80 

Gin Gin 
95 

Gin Glu 

Leu Arg 

Val Lys 

Asp Thr 
160 
Leu Trp 
175 

Pro Leu 
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Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 2 oo 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 
225 230 235 240 

Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

24 5 250 255 

Glu Gin Met Val Ser Gin Gin Arg He Leu He His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 
290 295 300 

, Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 
305 310 315 320 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg He Ser Pro Gly lie Arg Gly Thr His Lys Leu Pro Arg His Arq 
38 5 390 395 400 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 

405 410 415 

Gin Leu Val He Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys 

420 425 430 

Gin Tyr Gin Gin Gin lie His Met Asn Lys Leu Leu Ser Lys Ser He 

435 440 445 

Glu Gin Leu Lys Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu 

450 455 460 

Leu Gin Gly Asp Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Glv 
465 470 475 480 

Asn Ser Thr Arg Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly 

485 490 495 

Gin Val Gly Ala Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu 

500 505 510 

Asp Ala Gin He Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met 

515 520 525 

Gin Gin Pro Phe Leu Glu Pro Thr His Thr Arg Ala Leu Ser Val Arc? 

530 535 540 

Gin Ala Pro Leu Ala Ala Val Gly Met Asp Gly Leu Glu Lys His Arg 
545 550 555 560 

Leu Val Ser Arg Thr His Ser Ser Pro Ala Ala Ser Val Leu Pro His 

565 570 575 

Pro Ala Met Asp Arg Pro Leu Gin Pro Gly Ser Ala Thr Gly He Ala 

580 585 590 

Tyr Asp Pro Leu Met Leu Lys His Gin Cys Val Cys Gly Asn Ser Thr 

595 600 605 

Thr His Pro Glu His Ala Gly Arg He Gin Ser lie Trp Ser Arg Leu 

610 615 620 

Gin Glu Thr Gly Leu Leu Asn Lys Cys Glu Arg lie Gin Gly Arg Lys 
625 630 635 640 

Ala Ser Leu Glu Glu He Gin Leu Val His Ser Glu His His Ser Leu 

645 650 655 

Leu Tyr Gly Thr Asn Pro Leu Asp Gly Gin Lys Leu Asp Pro Arg He 

660 665 670 

Leu Leu Gly Asp Asp Ser Gin Lys Phe Phe Ser Ser Leu Pro Cys Gly 
675 680 685 
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Gly 


Leu 


Gly 


Val 


Asp Ser Asp Thr 




690 






695 


Gly 


Ala 


Ala 


Arg 


Met Ala Val Gly 


705 








710 


Val 


Ala 


Ser 


Gly 


Glu Leu Lys Asn 










725 


Gly 


His 


His 


Ala 


Glu Glu Ser Thr 








740 




Ser 


Val 


Ala 


He 


Thr Ala Lys Tyr 






755 




760 


Lys 


He 


Leu 


He 


Val Asp Leu Asp 




770 






775 


Gin 


Ala 


Phe 


Tyr 


Ala Asp Pro Ser 


785 








790 


Tyr 


Asp 


Glu 


Gly 


Asn Phe Phe Pro 










805 


Gly 


Thr 


Gly 


Leu 


Gly Glu Gly Tyr 








820 




Gly 


Leu 


Asp 


Pro 


Pro Met Gly Asp 






835 




840 


Thr 


He 


Val 


Lvs 


Pro Val Ala Lys 




850 






855 


Val 


Ser 


Ala 


Gly 


Phe Asp Ala Leu 


865 








870 


Gly 


Tyr 


Lys 


Val 


Thr Ala Lys Cys 










885 


Met 


Thr 


Leu 


Ala 


Asp Gly Arg Val 








900 




Asp 


Leu 


Thr 


Ala 


He Cys Asp Ala 






915 




920 


Leu 


Gly 


Asn 


Glu 


Leu Glu Pro Leu 




930 






935 


Pro 


Asn 


Met 


Asn 


Ala Val He Ser 


945 








950 


Ser 


Met 


Ser 


Leu 


Lys Phe Ser 



965 



10/25 



He 


Trp 


Asn 


Glu Leu His 


Ser 


Ser 








700 






Cys 


Val 


He 


Glu Leu Ala 


Ser 


Lys 






715 






720 


Gly 


Phe 


Ala 


Val Val Arg 


Pro 


Pro 




730 






735 




Ala 


Met 


Gly 


Phe Cys Phe 


Phe 


Asn 


745 






750 






Leu 


Arg 


Asp 


Gin Leu Asn 


He 


Ser 








765 






Val 


His 


His 


Gly Asn Gly 


Thr 


Gin 








780 






He 


Leu 


Tyr 


He Ser Leu 


His 


Arg 






795 






800 


Gly Ser 


Gly Ala Pro Asn 


Glu Val 




810 






815 




Asn 


He 


Asn He Ala Trp 


Thr Gly 


825 






830 






Val 


Glu 


Tyr 


Leu Glu Ala 


Phe Arg 








845 






Glu 


Phe 


Asp 


Pro Asp Met 


Val 


Leu 








860 






Glu Gly 


His 


Thr Pro Pro 


Leu Gly 






875 






880 


Phe Gly 


His 


Leu Thr Lys 


Gin 


Leu 




890 






895 




Val 


Leu 


Ala Leu Glu Gly 


Gly His 


905 






910 






Ser 


Glu 


Ala Cys Val Asn 


Ala 


Leu 








925 






Ala 


Glu 


Asp 


He Leu His 


Gin 


Ser 








940 






Leu 


Gin 


Lys 


He He Glu 


He 


Gin 






955 






960 



<210> 7 
<211> 3367 
<212> DNA 

<213> Homo sapiens 
<400> 7 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaaqacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgaatcc 
cccagttcac caaacaatgg gccaactgga 
ccccctaccc ctcatgccga gcaaatggtt 
tccatgaacc tgctaagtct ttatacctct 
cccgcagtgc catcccagct caatgcttcg 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
tcagtcagta gcagttctcc aggctctggt 840 
agtgttactg aaaatgagac ttcggttttg 900 
tcacagcaac gcattctaat tcatgaagat 960 
ccttctttgc ccaacattac cttggggctt 1020 
aattcactca aagaaaagca gaagtgtgag 1080 
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acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
™?™S-? a Cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 

aaccca ^ gca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
S"??^?? atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
^t^ a l Q a ^ cccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
tZtat^t 9 Ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
t*Z*tf;~ g t g ag ^ gaa "f a aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcSt caSaccSJ ?gg?Jgactt SIS 
S a f gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
^"^ tgtg tcatc S a gct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
tSSSSE h?™^ 99 CCatCacgct ^agaatcca cagccatggg gttcEgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggclgt 2580 
vattT^t tt^ 3 ^ gtttatttct ttagagcccc acttttattt gtatctttca 2640 
acaaaah^n t^t^t ac ^ cctaatt ttcttgtcct ttgctggtgt tttaaattac 2700 

tgaattgtcc catgggacca agaaccagtg cagaacaagt gcataaccca 2760 
gagcactgtt tgtcagggaa ggttgggctg atttgatgtg ttgtttgatg tttatttcaa 2820 
gagctcccat gtgcttgttt tcctctcttc ttgctttctt ccatttgctc tcttctctgc 28sS 
ccaccgtggt gtgtctttct cttcccaggt tggaacaggc cttggagaag ggtacaatat 2940 
^^" gcc tswacaswtflr gccttgatcc tcccatggga gatgttgagt accttgaagc 3000 
attcaggacc atcgtgaagc ctgtggccaa agagtttgat ccagacatgg tcttagtatc 3060 
g *t gC ?lt 9g aag 9 cca ^ac ccctcctcta ggagggtaca aagtgacggc 3120 
aaaatgtttt ggtcatttga cgaagcaatt gatgacattg gctgatggac gtgtggtgtt 3180 
aa ag9 t Catg atCtCaCagc catctgtgat gcatcagaag L?og22 lllo 
tgcccttcta ggaaatgagc tggagccact tgcagaagat attctccacc aaagcccgaa 3300 
ctcltaa 9ttatttctt t^^aagat cattgaaatt caaagtatgt cttlaaagtt 3360 



3367 



<210> 8 
<211> 835 
<212> PRT 
<213> Homo sapiens 

<400> 8 

Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val 

15 

Asp Leu Arc 
30 

Gin Leu Glr 
45 

l Leu Leu Xle Gin Gin Gin filn mr, ti= m« 

50 

He Ale 

65 ,., „ E 8Q 

He Lys Gin Glr 
95 

Gin Arg Gin Glu 
110 

Pro Pro Leu Arg 
125 



Ser 


Met 


He 


Ser 


Ser Val 


Asp Val 


Lys Ser 


Glu 




5 








10 




Pro 


He 


Ser 


Pro Leu 


Asp 


Leu 


Arg Thr 




20 








25 




Pro 


Val 


Val 


Asp 


Pro Val 


Val 


Arg 


Glu Lys 


35 








40 




Leu 


Leu 


lie 


Gin 


Gin Gin 


Gin 


Gin 


He Gin 


Glu 


Phe 






55 






60 


Gin 


Lys 


Gin His 


Glu 


Asn 


Leu Thr 








70 








75 


Leu 


Gin 


Glu 


His 


He Lys 


Glu 


Leu 


Leu Ala 






85 








90 




Leu 


Leu 


Glu 


Lys 


Glu Gin 


Lys 


Leu 


Glu Gin 




100 








105 






Val Glu Arg 


His 


Arg Arg 


Glu 


Gin 


Gin Leu 


115 








120 
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Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
145 150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 ' 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 
225 230 235 240 

Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 255 

Glu Gin Met Val Ser Gin Gin Arg lie Leu He His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 

290 295 300 

Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 
305 310 315 320 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg He Ser Pro Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg 
385 390 395 400 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 

405 410 415 

Gin Leu Val He Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys 

420 425 430 

Gin Tyr Gin Gin Gin lie His Met Asn Lys Leu Leu Ser Lys Ser He 

435 440 445 

Glu Gin Leu Lys Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu 

450 455 460 

Leu Gin Gly Asp Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly 
465 470 475 480 

Asn Ser Thr Arg Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly 

485 m 490 495 

Gin Val Gly Ala Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu 

500 505 510 

Asp Ala Gin He Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met 

515 520 525 

Gin Gin Pro Phe Leu Glu Pro Thr His Thr Arg Ala Leu Ser Val Arg 

530 535 540 

Gin Ala Pro Leu Ala Ala Val Gly Met Asp Gly Leu Glu Lys His Arg 
545 550 555 560 

Leu Val Ser Arg Thr His Ser Ser Pro Ala Ala Ser Val Leu Pro His 

565 570 * 575 

Pro Ala Met Asp Arg Pro Leu Gin Pro Gly Ser Ala Thr Gly He Ala 

580 585 590 

Tyr Asp Pro Leu Met Leu Lys His Gin Cys Val Cys Gly Asn Ser Thr 

595 600 605 

Thr His Pro Glu His Ala Gly Arg He Gin Ser He Trp Ser Arg Leu 
610 615 620 
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Gin Glu Thr Gly Leu Leu Asn Lys Cys Glu Arg He Gin Gly Arg Lys 
625 630 635 " 640 

Ala Ser Leu Glu Glu He Gin Leu Val His Ser Glu His His Ser Leu 

645 650 655 

Leu Tyr Gly Thr Asn Pro Leu Asp Gly Gin Lys Leu Asp Pro Arg He 

660 665 670 

Leu Leu Gly Asp Asp Ser Gin Lys Phe Phe Ser Ser Leu Pro Cys Gly 

675 680 685 

Gly Leu Gly Val Asp Ser Asp Thr He Trp Asn Glu Leu His Ser Ser 

690 695 700 

Gly Ala Ala Arg Met Ala Val Gly Cys Val He Glu Leu Ala Ser Lys 
705 710 715 720 

Val Ala Ser Gly Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro 

725 730 735 

Gly His His Ala Glu Glu Ser Thr Ala Met Gly Phe Cys Phe Phe Asn 

740 745 750 

Ser Val Ala He Thr Ala Lys Tyr Leu Arg Asp Gin Leu Asn He Ser 

755 760 765 

Lys lie Leu He Val Asp Leu Asp Val His His Gly Asn Gly Thr Gin 

770 775 780 

Gin Ala Phe Tyr Ala Asp Pro Ser He Leu Tyr He Ser Leu His Arg 
785 790 795 800 

Tyr Asp Glu Gly Asn Phe Phe Pro Gly Ser Gly Ala Pro Asn Glu Val 

805 810 815 

Arg Phe He Ser Leu Glu Pro His Phe Tyr Leu Tyr Leu Ser Gly Asn 
820 825 830 

Cys He Ala 
835 

<210> 9 

<211> 1791 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 12 0 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 
cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 
ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 
tccatgaacc tgctaagtct ttatacctct ccttctttgc ccaacattac cttggggctt 1020 
cccgcagtgc catcccagct caatgcttcg aattcactca aagaaaagca gaagtgtgag 1080 
acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
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atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 

gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 

gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 

gtaataggca aagatttagc tccaggattt gtaattaaag tcattatctg a 1791. 

<210> 10 

<211> 546 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

1 5 10 15 

Gly Leu Glu Pro lie Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 ' 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

50 55 60 

He Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 110 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg; 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
145 150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Glu Ser Ser Val Ser Ser 

210 215 220 

Ser Ser Pro Gly Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly 
225 230 235 240 

Ser Val Thr Glu Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala 

245 250 255 

Glu Gin Met Val Ser Gin Gin Arg He Leu He His Glu Asp Ser Met 

260 265 270 

Asn Leu Leu Ser Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu 

275 280 285 

Gly Leu Pro Ala Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys 

290 295 300 

Glu Lys Gin Lys Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu 
305 310 315 320 

Pro Gly Gin Tyr Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His 

325 330 335 

Val Thr Leu Glu Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu 

340 345 350 

Gin His Leu Leu Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val 

355 360 365 

Ala Gly Gly Val Pro Leu His Pro Gin Ser Pro Leu Ala Thr Lys Glu 

370 375 380 

Arg He Ser Pro Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg 
385 390 395 400 

Pro Leu Asn Arg Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala 
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405 410 415 

Gin Leu Val lie Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lvs 

420 425 430 

Gin Tyr Gin Gin Gin lie His Met Asn Lys Leu Leu Ser Lys Ser He 

^35 440 445 

Glu Gin Leu Lys Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu 

450 455 460 

Leu Gin Gly Asp Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly 
465 4 70 475 480 

Asn Ser Thr Arg Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly 

485 490 495 

Gin Val Gly Ala Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu 

500 505 510 

Asp Ala Gin He Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met 

515 520 525 

Gin Gin Val He Gly Lys Asp Leu Ala Pro Gly Phe Val He Lys Val 

_ 530 535 540 

He He 
545 



<210> 11 
<211> 590 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Met His Ser Met He Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

5 10 15 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

50 55 60 

lie Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 

70 75 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala He Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 no 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Ara 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 
„_ 130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
145 150 155 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 ' 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Ala Ser Glu Pro Asn Leu 

210 215 220 

Lys Val Arg Ser Arg Leu Lys Gin Lys Val Ala Glu Arg Arg Ser Ser 
225 23 0 235 240 

Pro Leu Leu Arg Arg Lys Asp Gly Asn Val Val Thr Ser Phe Lys Lys 

245 250 255 

Arg Met Phe Glu Val Thr Glu Ser Ser Val Ser Ser Ser Ser Pro Gly 

260 265 270 

Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly Ser Val Thr Glu 
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275 










280 






285 




Asn 


Glu 


Thr 


Ser 


Val 


Leu 


Pro 


Pro 


Thr Pro 


His 


Ala Glu Gin Met 


Val 




290 










295 








300 




Ser 


Gin 


Gin 


Arg 


lie 


Leu 


He 


His 


Glu Asp 


Ser 


Met Asn Leu Leu 


Ser 


305 










310 








315 




320 


Leu 


Tvr 


Thr 


Ser 


Pro 


Ser 


Leu 


Pro 


Asn He 


Thr 


Leu Glv Lpii Prn 


Ala 










325 








330 




335 




Val 


Pro 


Ser 


Gin 


Leu 


Asn 


Ala 


Ser 


Asn Ser 


Leu 


Lvs Glu Lvs Gin 


Lys 








340 










345 




350 




Cys 


Glu 


Thr 


Gin 


Thr Leu Arg Gin 


Gly Val 


Pro 


Leu Pro Glv Gin 


Tvr 






355 










360 






365 




Gly 


Glv 


Ser 


He 


Pro 


Ala 


Ser 


Ser 


Ser His 


Pro 


His Val Thr Leu 


Glu 




370 










375 








380 




Glv 


Lys 


Pro 


Pro 


Asn 


Ser 


Ser 


His 


Gin Ala 


Leu 


Leu Gin His Lem 


Leu 


385 










390 








395 




400 


Leu 


Lys 


Glu 


Gin 


Met 


Arg 


Gin 


Gin 


Lys Leu 


Leu 


Val Ala Gly Gly 


Val 










405 








410 




41 S 




Pro 


Leu 


His 


Pro 


Gin 


Ser 


Pro 


Leu 


Ala Thr 


Lys 


filu Afrr T"lp> Spr 


Pro 








420 










425 




430 




Glv 


He 


Arg 


Glv 


Thr 


His 


Lys 


Leu 


Pro Arg 


His 


A ttt Prn Tif^n A c-n 

fU> y -C J_ yj _UJ \z. Li noil 


fai y 






*± j j 










440 






*± *± j 




Thr 


Gin 


O CI 


Ala 


Pro 


Leu 


Pro 


Gin 


Cor* TVir* 

OCl -LUX 


x^eu 


aia nin T.pn Vai 

Ala JL ll Jjcu val 


Tip 

lie 




dsn 










455 








f± O \J 




Vzf 11* 


f3l n 

Vjlll 


Vjlli 




Gin 


Gin 


Phe 


Leu 


f21 1 1 T.ire 

biu uys 


pi n 

Vjlll 


T^/e filn THrv* Pin 

x»ys vjxxi iyL bin 


ijxn 


465 










470 








*± / j 




t± o u 


Gin 


He 


His 


Met 


Asn Lys Leu 


Leu 






11C VjlU ulll JJcU 












485 








4Qft 




^* y d 


Gin 


Pro 


Gly 


Ser 


His 


Leu 


Glu 


Glu 


Ala Glu 


Glu 


Glu Leu Gin Gly 


Asp 








500 










505 




510 




Gin 


Ala 


Met 


Gin 


Glu Asp Arg Ala 


Pro Ser 


Ser 


Gly Asn Ser Thr 


Arg 






515 










520 






525 




Ser 


Asp 


Ser 


Ser 


Ala 


Cys 


Val 


Asp 


Asp Thr 


Leu 


Gly Gin Val Gly 


Ala 




530 










535 








540 




Val 


Lys 


Val 


Lys 


Glu 


Glu 


Pro 


Val 


Asp Ser 


Asp 


Glu Asp Ala Gin 


He 


545 










550 








555 




560 


Gin 


Glu 


Met 


Glu 


Ser Gly Glu 


Gin 


Ala Ala 


Phe 


Met Gin Gin Val 


He 










565 








570 




575 




Gly 


Lys 


Asp 


Leu 


Ala 


Pro Gly Phe 


Val He 


Lys 


Val lie lie 










580 










585 




590 





<210> 12 

<211> 1084 

<212> PRT 

<213> Homo sapiens 



<400> 12 



Met 


Ser 


Ser 


Gin 


Ser His Pro 


Asp 


Gly Leu 


Ser 


Gly Arg Asp Gin 


Pro 


1 








5 




10 




15 




Val 


Glu 


Leu 


Leu 


Asn Pro Ala 


Arg 


Val Asn 


His 


Met Pro Ser Thr 


Val 








20 






25 




30 




Asp 


Val 


Ala 


Thr 


Ala Leu Pro 


Leu 


Gin Val 


Ala 


Pro Ser Ala Val 


Pro 






35 






40 






45 




Met 


Asp 


Leu 


Arg 


Leu Asp His 


Gin 


Phe Ser 


Leu 


Pro Val Ala Glu 


Pro 




50 






55 








60 




Ala 


Leu 


Arg 


Glu 


Gin Gin Leu 


Gin 


Gin Glu 


Leu 


Leu Ala Leu Lys 


Gin 


65 








70 






75 




80 


Lys 


Gin 


Gin 


He 


Gin Arg Gin 


He 


Leu He 


Ala 


Glu Phe Gin Arg 


Gin 










85 




90 




95 




His 


Glu 


Gin 


Leu 


Ser Arg Gin His 


Glu Ala 


Gin 


Leu His Glu His 


He 








100 






105 




110 




Lys 


Gin 


Gin 


Gin 


Glu Met Leu 


Ala 


Met Lys 


His 


Gin Gin Glu Leu 


Leu 
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115 120 125 

Glu His Gin Arg Lys Leu Glu Arg His Arg 'Gln Glu Gin Glu Leu Glu 

r I? 0 • 135 140 

Lys Gin His Arg Glu Gin Lys Leu Gin Gin Leu Lys Asn Lys Glu Lys 
I 50 155 160 

Gly Lys Glu Ser Ala Val Ala Ser Thr Glu Val Lys Met Lys Leu Gin 

165 170 175 

Glu Phe. Val Leu Asn Lys Lys Lys Ala Leu Ala His Arg Asn Leu Asn 

I 80 185 190 

His Cys lie Ser Ser Asp Pro Arg Tyr Trp Tyr Gly Lys Thr Gin His 
195 200 205 

??« LeU ASP Gln Ser Ser Pro Pro Gln Gly Val Ser Thr Ser 
210 215 220 

Tyr Asn His Pro Val Leu Gly Met Tyr Asp Ala Lys Asp Asp Phe Pro 
r 230 235 240 

Leu Arg Lys Thr Ala Ser Glu Pro Asn Leu Lys Leu Arg Ser Arg Leu 

245 250 255 

Lys Gln Lys Val Ala Glu Arg Arg Ser Ser Pro Leu Leu Arg Arg Lys 

260 265 270 

Asp Gly Pro Val Val Thr Ala Leu Lys Lys Arg Pro Leu Asp Val Thr 

275 280 285 

Asp Ser Ala Cys Ser Ser Ala Pro Gly Ser Gly Pro Ser Ser Pro Asn 

290 295 300 

Asn Ser Ser Gly Ser Val Ser Ala Glu Asn Gly He Ala Pro Ala Val 
310 315 320 

Pro Ser He Pro Ala Glu Thr Ser Leu Ala His Arg Leu Val Ala Arg 

325 330 335 

Glu Gly Ser Ala Ala Pro Leu Pro Leu Tyr Thr Ser Pro Ser Leu Pro 

340 345 35Q 

Asn He Thr Leu Gly Leu Pro Ala Thr Gly Pro Ser Ala Gly Thr Ala 

355 3 6 o 365 

Gly Gln Gln Asp Thr Glu Arg Leu Thr Leu Pro Ala Leu Gln Gln Ara 

370 375 380 

Leu Ser Leu Phe Pro Gly Thr His Leu Thr Pro Tyr Leu Ser Thr Ser 
390 395 40Q 

Pro Leu Glu 'Arg Asp Gly Gly Ala Ala His Ser Pro Leu Leu Gln His 

• 405 410 415 

Met Val Leu Leu Glu Gln Pro Pro Ala Gln Ala Pro Leu Val Thr Gly 

420 425 430 

Leu Gly Ala Leu Pro Leu His Ala Gln Ser Leu Val Gly Ala Asp Arg 

4J:> 440 445 

Val Ser Pro Ser He His Lys Leu Arg Gln His Arg Pro Leu Gly Arg 

450 455 46O 

Thr Gln Ser Ala Pro Leu Pro Gln Asn Ala Gln Ala Leu Gln His Leu 
470 475 480 

Val He Gln Gln Gln His Gln Gln Phe Leu Glu Lys His Lys Gln Gln 

485 4 9° 495 

Phe Gln Gln Gln Gln Leu Gln Met Asn Lys He He Pro Lys Pro Ser 

500 505 510 

Glu Pro Ala Arg Gln Pro Glu Ser His Pro Glu Glu Thr Glu Glu Glu 

515 520 525 

Leu Arg Glu His Gln Ala Leu Leu Asp Glu Pro Tyr Leu Asp Arg Leu 

535 540 
Pro Gly Gln Lys Glu Ala His Ala Gln Ala Gly Val Gln Val Lys Gln 
550 555 cgQ 

Glu Pro He Glu Ser Asp Glu Glu Glu Ala Glu Pro Pro Arg Glu Val 

565 570 ' 575 

Glu Pro Gly Gln Arg Gln Pro Ser Glu Gln Glu Leu Leu Phe Arg Gln 

580 5 8 5 59Q 

Gln Ala Leu Leu Leu Glu Gln Gln Arg He His Gln Leu Arg Asn Tyr 

595 600 605 

Gln Ala Ser Met Glu Ala Ala Gly He Pro Val Ser Phe Gly Gly His 
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610 






615 










620 








Arg 


Pro 


Leu 


Ser Arg Ala 


Gin 


Ser 


Ser 


Pro 


Ala 


Ser Ala 


Thr 


Phe 


Pro 


625 






630 










635 








640 


Val 


Ser 


Val 


Gin Glu Pro 


Pro 


Thr 


Lys 


Pro 


Arg 


Phe Thr 


Thr 


Gly 


Leu 








645 








650 








655 




Val 


Tyr 


Asp 


Thr Leu Met 


Leu 


Lys 


His 


Gin 


Cys 


Thr Cys 


Gly 


Ser 


Ser 








660 






665 








670 






Ser 


Ser 


His 


Pro Glu His 


Ala 


Gly 


Arg 


He 


Gin 


Ser He 


Trp 


Ser 


Arg 






675 






680 








685 








Leu 


Gin 


Glu 


Thr Gly Leu 


Arg 


Gly 


Lys 


Cys 


Glu 


Cys He 


Arg 


Gly 


Arg 




690 






695 










700 








Lys 


Ala 


Thr 


Leu Glu Glu 


Leu 


Gin 


Thr 


Val 


His 


Ser Glu 


Ala 


His 


Thr 


705 






710 










715 








720 


Leu 


Leu 


Tyr 


Gly Thr Asn 


Pro 


Leu 


Asn 


Arg 


Gin 


Lys Leu 


Asp 


Ser 


Lvs 








725 








730 








735 




Lys 


Leu 


Leu 


Gly Ser Leu 


Ala 


Ser 


Val 


Phe 


Val 


Arg Leu 


Pro 


Cys 


Gly 








740 






745 








750 






Gly 


Val 


Gly 


Val Asp Ser 


Asp 


Thr 


He 


Trp 


Asn 


Glu Val 


His 


Ser 


Ala 






755 






760 








765 








Gly 


Ala 


Ala 


Arg Leu Ala 


Val 


Gly 


Cys 


Val 


Val 


Glu Leu 


Val 


Phe 


Lys 




770 






775 










780 








Val 


Ala 


Thr 


Gly Glu Leu 


Lys 


Asn 


Gly 


Phe 


Ala 


Val Val 


Arg 


Pro 


Pro 


785 






790 










795 








800 


Gly 


His 


His 


Ala Glu Glu 


Ser 


Thr 


Pro 


Met 


Gly 


Phe Cys 


Tvr 


Phe 


Asn 








805 








810 








815 




Ser 


Val 


Ala 


Val Ala Ala 


Lys 


Leu 


Leu 


Gin 


Gin 


Arg Leu 


Ser 


Val 


Ser 








820 






825 








830 






Lys 


lie 


Leu 


He Val Asp 


Trp 


Asp 


Val 


His 


His 


Gly Asn 


Gly 


Thr 


Gin 






835 






840 








845 








Gin 


Ala 


Phe 


Tyr Ser Asp 


Pro 


Ser 


Val 


Leu 


Tyr 


Met Ser 


Leu 


His 


Arcr 




850 






855 










860 








Tyr 


Asp 


Asp 


Gly Asn Phe 


Phe 


Pro 


Gly Ser 


Gly 


Ala Pro 


Asp 


Glu 


Val 


865 






870 










875 








880 


Gly 


Thr 


Gly 


Pro Gly Val 


Gly 


Phe 


Asn 


Val 


Asn 


Met Ala 


Phe 


Thr 


Gly 








885 








890 








895 




Gly 


Leu 


Asp 


Pro Pro Met 


Gly 


Asp 


Ala 


Glu 


Tyr 


Leu Ala 


Ala 


Phe 


Arcr 








900 






905 








910 






Thr 


Val 


Val 


Met Pro He 


Ala 


Ser 


Glu 


Phe 


Ala 


Pro Asp 


Val 


Val 


Leu 






915 






920 








925 








Val 


Ser 


Ser 


Gly Phe Asp 


Ala 


Val 


Glu Gly 


His 


Pro Thr 


Pro 


Leu 


Gly 




930 






935 










940 








Gly 


Tyr 


Asn 


Leu Ser Ala 


Arg 


Cys 


Phe Gly 


Tyr 


Leu Thr 


Lys 


Gin 


Leu 


945 






950 










955 








960 


Met 


Gly 


Leu 


Ala Gly Gly 


Arg 


He 


Val 


Leu 


Ala 


Leu Glu 


Gly 


Gly 


His 








965 








970 








975 




Asp 


Leu 


Thr 


Ala He Cys 


Asp 


Ala 


Ser 


Glu 


Ala 


Cys Val 


Ser 


Ala 


Leu 








980 






985 








990 






Leu 


Gly 


Asn 


Glu Leu Asp 


Pro 


Leu 


Pro 


Glu 


Lys 


Val Leu 


Gin 


Gin 


Arg 






995 






1000 






1005 






Pro 


Asn 


Ala 


Asn Ala Val 


Arg 


Ser 


Met 


Glu 


Lys 


Val' Met 


Glu 


He 


His 




1010 




1015 








1020 








Ser 


Lys 


Tyr 


Trp Arg Cys 


Leu 


Gin 


Arg 


Thr 


Thr 


Ser Thr 


Ala 


Gly 


Arg 


1025 




1030 








1035 






1040 


Ser 


Leu 


He 


Glu Ala Gin 


Thr 


Cys 


Glu 


Asn 


Glu 


Glu Ala 


Glu 


Thr 


Val 








1045 








1050 






1055 


Thr 


Ala 


Met 


Ala Ser Leu 


Ser 


Val 


Gly Val Lys 


Pro Ala 


Glu 


Lys Arg 








1060 






1065 






1070 




Pro Asp Glu 


Glu Pro Met 


Glu 


Glu 


Glu 


Pro 


Pro 


Leu , 









1075 1080 



<210> 13 
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<211> 3550 

<212> DNA 

<213> Homo sapiens 

<400> 13 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatqc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgcctct gagcccaact tgaaggtgcg gtccaggtta 840 
aaacagaaag tggcagagag gagaagcagc cccttactca ggcggaagga tggaaatgtt 900 
gtcacttcat tcaagaagcg aatgtttgag gtgacagaat cctcagtcag tagcagttct 960 
ccaggctctg gtcccagttc accaaacaat gggccaactg gaagtgttac tgaaaatgag 1020 
acttcggttt tgccccctac ccctcatgcc gagcaaatgg tttcacagca acgcattcta 1080 
attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
agcagtgctt gtgtggatga cacactggga caagttgggg ctgtgaaggt caaggaggaa 1800 
ccagtggaca gtgatgaaga tgctcagatc caggaaatgg aatctgggga gcaggctgct 1860 
tttatgcaac aggtaatagg caaagattta gctccaggat ttgtaattaa agtcattatc 1920 
tgacctttcc tggaacccac gcacacacgt gcgctctctg tgcgccaagc tccgctggct 1980 
gcggttggca tggatggatt agagaaacac cgtctcgtct ccaggactca ctcttcccct 2040 
gctgcctctg ttttacctca cccagcaatg gaccgccccc tccagcctgg ctctgcaact 2100 
ggaattgcct atgacccctt gatgctgaaa caccagtgcg tttgtggcaa ttccaccacc 2160 
caccctgagc atgctggacg aatacagagt atctggtcac gactgcaaga aactgggctg 2220 
ctaaataaat gtgagcgaat tcaaggtcga aaagccagcc tggaggaaat acagcttgtt 2280 
cattctgaac atcactcact gttgtatggc accaaccccc tggacggaca gaagctggac 2340 
cccaggatac tcctaggtga tgactctcaa aagttttttt cctcattacc ttgtggtgga 2400 
cttggggtgg acagtgacac catttggaat gagctacact cgtccggtgc tgcacgcatg 2460 
gctgttggct gtgtcatcga gctggcttcc aaagtggcct caggagagct gaagaatggg 2520 
tttgctgttg tgaggccccc tggccatcac gctgaagaat ccacagccat ggggttctgc 2580 
ttttttaatt cagttgcaat taccgccaaa tacttgagag accaactaaa tataagcaag 2640 
atattgattg tagatctgga tgttcaccat ggaaacggta cccagcaggc cttttatgct 2700 
gaccccagca tcctgtacat ttcactccat cgctatgatg aagggaactt tttccctggc 2760 
agtggagccc caaatgaggt tcggtttatt tctttagagc cccactttta tttgtatctt 2820 
tcaggtaatt gcattgcatg attaccccta attttcttgt cctttgctgg tgttttaaat 2880 
tacacgagat tactgaattg tcccatggga ccaagaacca gtgcagaaca agtgcataac 2940 
ccagagcact gtttgtcagg gaaggttggg ctgatttgat gtgttgtttg atgtttattt 3000 
caagagctcc catgtgcttg ttttcctctc ttcttgcttt cttccatttg ctctcttctc 3060 
tgcccaccgt ggtgtgtctt tctcttccca ggttggaaca ggccttggag aagggtacaa 3120 
tataaatatt gcctggacag gtggccttga tcctcccatg ggagatgttg agtaccttga 3180 
agcattcagg accatcgtga agcctgtggc caaagagttt gatccagaca tggtcttagt 3240 
atctgctgga tttgatgcat tggaaggcca cacccctcct ctaggagggt acaaagtgac 3300 
ggcaaaatgt tttggtcatt tgacgaagca attgatgaca tfcggctgatg gacgtgtggt 3360 
gttggctcta gaaggaggac atgatctcac agccatctgt gatgcatcag aagcctgtgt 3420 
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aaatgccctt ctaggaaatg agctggagcc acttgcagaa gatattctcc accaaagccc 3480 
gaatatgaat gctgttattt ctttacagaa gatcattgaa attcaaagta tgtctttaaa 3540 
gttctcttaa 3550 

<210> 14 

<211> 7699 . 

<212> DNA 

<213> Homo sapiens 



<400> 14 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 
ccgtcacaga cccctgaacc gaacccagtc 
gctggtcatt caacagcaac accagcaatt 
gatccacatg aacaaactgc tttcgaaatc 
ccttgaggaa gcagaggaag agcttcaggg 
ctctagtggc aacagcacta ggagcgacag 
agttggggct gtgaaggtca aggaggaacc 
ggaaatggaa tctggggagc aggctgcttt 
cacacgtgcg ctctctgtgc gccaagctcc 
gaaacaccgt ctcgtctcca ggactcactc 
agcaatggac cgccccctcc agcctggctc 
gctgaaacac cagtgcgttt gtggcaattc 
acagagtatc tggtcacgac tgcaagaaac 
aggtcgaaaa gccagcctgg aggaaataca 
gtatggcacc aaccccctgg acggacagaa 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 300 
ttattaatag taatcaatta cggggtcatt 360 
tacataactt acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
tgcacctttg cctcagagca cgttggctca 2340 
cttggagaag cagaagcaat accagcagca 2400 
tattgaacaa ctgaagcaac caggcagtca 2460 
ggaccaggcg atgcaggaag acagagcgcc 2520 
cagtgcttgt gtggatgaca cactgggaca 2580 
agtggacagt gatgaagatg ctcagatcca 2640 
tatgcaacag cctttcctgg aacccacgca 2700 
gctggctgcg gttggcatgg atggattaga 2760 
ttcccctgct gcctctgttt tacctcaccc 2820 
tgcaactgga attgcctatg accccttgat 2880 
caccacccac cctgagcatg ctggacgaat 2940 
tgggctgcta aataaatgtg agcgaattca 3000 
gcttgttcat tctgaacatc actcactgtt 3060 
gctggacccc aggatactcc taggtgatga 3120 
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ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttttt ccctggcagt ggagccccaa atgaggttgg 3540 
aacaggcctt ggagaagggt acaatataaa tattgcctgg acaggtggcc ttgatcctcc 3600 
catgggagat gttgagtacc ttgaagcatt caggaccatc gtgaagcctg tggccaaaga 3660 
gtttgatcca gacatggtct tagtatctgc tggatttgat gcattggaag gccacacccc 3720 
tcctctagga gggtacaaag tgacggcaaa atgttttggt catttgacga agcaattgat 3780 
gacattggct gatggacgtg tggtgttggc tctagaagga ggacatgatc tcacagccat 3840 
fc <; agaagcct gtgtaaatgc ccttctagga aatgagctgg agccacttgc 3900 
agaagatatt ctccaccaaa gcccgaatat gaatgctgtt atttctttac agaagatcat 3960 
tgaaattcaa agtatgtctt taaagttctc tggatccggt accagattac aaggacgacg 4020 
atgacaagta gatcccgggt ggcatccctg tgacccctcc ccagtgcctc tcctggcctt 4080 
ggaagttgcc actccagtgc ccaccagcct tgtcctaata aaattaagtt gcatcatttt 4140 
gtctgactag gtgtcctcta taatattatg gggtggaggg gggtggtatg gagcaagggg 4200 
cccaagttgg gaagacaacc tgtagggcct gcggggtcta ttcgggaacc aagctggagt 4260 
gcagtggcac aatcttggct cactgcaatc tccgcctcct gggttcaagc gattctcctg 4320 
ttttt^ t ccgagtt ^ tt g^attccag gcatgcatga ccaggctcag ctaatttttg 4380 
tttttttggt agagacgggg tttcaccata ttggccaggc tggtctccaa ctcctaatct 4440 
caggtgatct acccaccttg gcctcccaaa ttgctgggat tacaggcgtg aaccactgct 4500 
™^™^= g ^ cttc ^at tttaaaataa ctataccagc aggaggacgt ccagacacag 4560 
cataggctac ctgccatggc ccaaccggtg ggacatttga gttgcttgct tggcactgtc 4620 
Zl^t^ "gggtccac tcagtagatg cctgttgaat tgggtacgcg gccagcttct 4680 
gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag gcagaagtat 4740 
gcaaagcatg catctcaatt agtcagcaac caggtgtgga aaagtcccca ggctccccag 4800 
caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc ccgcccctaa 4860 
"^ g ^ a £ c^^ta actccgccca gttccgccca ttctccgccc catggctgac 4920 
tatttatgca gaggccgagg ccgcctcggc ctctgagcta ttccagaagt 4980 
a c g aS ggc f aggct tttgcaaaaa gctcctcgag gaactgaaaa 5040 

accagaaagt taattcccta tagtgagtcg tattaaattc gtaatcatgg tcatagctgt 5100 
"^ gtgtg aaat tgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 5160 
agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 5220 
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 5280 
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 5340 
gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 5400 
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 5460 
ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 5520 
^^ aaaaa tc 9f c 9 c tca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 5580 
*%?*~t t ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 5640 
ga ^ C ^ gtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc tcacgctgta 5700 
gg ^ Ctcag ttc 99tgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 5760 
=^ ag ^°? a cc 9 c tgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 5820 
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 5880 
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 5940 
ff cgctct 9 rct 5r aagccagtta ccttcggaaa aagagttggt agctcttgat 6000 
ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 6060 
gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 6120 
^ aa ^ 9aaaa ctca fStt*a gggattttgg tcatgagatt atcaaaaagg atcttcacct 6180 
agaccctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 6240 
ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 6300 
gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 6360 
catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 6420 
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 6480 
cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 6540 
gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 6600 
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 6660 
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 6720 
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 6780 
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 6840 
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gaccgagttg ctcttgcccg gcgtcaatac 
taaaagtgct catcattgga aaacgttctt 
tgttgagatc cagttcgatg taacccactc 
ctttcaccag cgtttctggg tgagcaaaaa 
taagggcgac acggaaatgt tgaatactca 
tttatcaggg ttattgtctc atgagcggat 
aaataggggt tccgcgcaca tttccccgaa 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 
cggcctattg gttaaaaaat gagctgattt 
tattaaacgt ttacaattt 



gggataatac cgcgccacat agcagaactt 6900 
cggggcgaaa actctcaagg atcttaccgc 6960 
gtgcacccaa ctgatcttca gcatctttta 7020 
caggaaggca aaatgccgca aaaaagggaa 7080 
tactcttcct ttttcaatat tattgaagca 7140 
acatatttga atgtatttag aaaaataaac 7200 
aagtgccacc tgacgcgccc tgtagcggcg 7260 
gcagcgtgac cgctacactt gccagcgccc 7320 
cctttctcgc cacgttcgcc ggctttcccc 7380 
ggttccgatt tagtgcttta cggcacctcg 7440 
cacgtagtgg gccatcgccc tgatagacgg 7500 
tctttaatag tggactcttg ttccaaactg 7560 
cttttgattt ataagggatt ttgccgattt 7620 
aacaaaaatt taacgcgaat tttaacaaaa 7680 

7699 



<210> 15 

<211> 7303 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agfctccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg 'gcaacaaaag agagaatttc 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 3 00 
ttattaatag taatcaatta cggggtcatt 3 60 
tacataactt acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg .480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 " 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
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ccgtcacaga cccctgaacc gaacccagtc tgcacctttg cctcagagca cgttggctca 2340 
gctggtcatt caacagcaac accagcaatt cttggagaag cagaagcaat accagcagca 2400 
gatccacatg aacaaactgc tttcgaaatc tattgaacaa ctgaagcaac caggcagtca 2460 
ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg atgcaggaag acagagcgcc 2520 
ctctagtggc aacagcacta ggagcgacag cagtgcttgt gtggatgaca cactgggaca 2580 
agttggggct gtgaaggtca aggaggaacc agtggacagt gatgaagatg ctcagatcca 2640 
ggaaatggaa tctggggagc aggctgcttt tatgcaacag cctttcctgg aacccacgca 2700 
cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg gttggcatgg atggattaga 2760 
gaaacaccgt ctcgtctcca ggactcactc ttcccctgct gcctctgttt tacctcaccc 2820 
agcaatggac cgccccctcc agcctggctc tgcaactgga attgcctatg accccttgat 2880 
gctgaaacac cagtgcgttt gtggcaattc caccacccac cctgagcatg ctggacgaat 2940 
acagagtatc tggtcacgac tgcaagaaac tgggctgcta aataaatgtg agcgaattca 3000 
aggtcgaaaa gccagcctgg aggaaataca gcttgttcat tctgaacatc actcactgtt 3060 
gtatggcacc aaccccctgg acggacagaa gctggacccc aggatactcc taggtgatga 3120 
ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttttt ccctggcagt ggagccccaa atgaggttcg 3540 
gtttatttct ttagagcccc acttttattt gtatctttca ggtaattgca ttgcaggatc 3600 
cggtaccaga ttacaaggac gacgatgaca agtagatccc gggtggcatc cctgtgaccc 3660 
ctccccagtg cctctcctgg ccttggaagt tgccactcca gtgcccacca gccttgtcct 3720 
aataaaatta agttgcatca ttttgtctga ctaggtgtcc tctataatat tatggggtgg 378 0 
aggggggtgg tatggagcaa ggggcccaag ttgggaagac aacctgtagg gcctgcgggg 3840 
tctattcggg aaccaagctg gagtgcagtg gcacaatctt ggctcactgc aatctccgcc 3900 
tcctgggttc aagcgattct cctgcctcag cctcccgagt tgttgggatt ccaggcatgc 3960 
atgaccaggc tcagctaatt tttgtttttt tggtagagac ggggtttcac catattggcc 4020 
aggctggtct ccaactccta atctcaggtg atctacccac cttggcctcc caaattgctg 4080 
ggattacagg cgtgaaccac tgctcccttc cctgtccttc tgattttaaa ataactatac 4140 
cagcaggagg acgtccagac acagcatagg ctacctgcca tggcccaacc ggtgggacat 4200 
ttgagttgct tgcttggcac tgtcctctca tgcgttgggt ccactcagta gatgcctgtt 4260 
gaattgggta cgcggccagc ttctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc 4320 
aggctcccca gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccaggtg 4380 
tggaaaagtc cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt 4440 
cagcaaccat agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg 4500 
cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct 4560 
cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca 4620 
aaaagctcct cgaggaactg aaaaaccaga aagttaattc cctatagtga gtcgtattaa 4680 
attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac 4740 
acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac 4800 
tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc 4860 
tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg 4920 
cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc 4980 
actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt 5040 
gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc 5100 ' 
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 5160 
acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc 5220 
ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg 5280 
cgctttctca atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc 5340 
tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc 5400 
gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca 5460 
ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact 5520 
acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg 5580 
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt 5640 
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 5700 
tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga 5760 
gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa 5820 
tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac 5880 
ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga 5940 
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc 6000 
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cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 
cacctgacgc 
tgaccgctac 
tcgccacgtt 
gatttagtgc 
gtgggccatc 
atagtggact 
atttataagg 
aatttaacgc 



ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
ttcagcatct 
cgcaaaaaag 
atattattga 
ttagaaaaat 
gccctgtagc 
acttgccagc 
cgccggcttt 
tttacggcac 
gccctgatag 
cttgttccaa 
gattttgccg 
gaattttaac 



ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 
ccgctgttga 
tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 
ggcgcattaa 
gccctagcgc 
ccccgtcaag 
ctcgacccca 
acggtttttc 
actggaacaa 
atttcggcct 
aaaatattaa 



taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 
gcgcggcggg 
ccgctccttt 
ctctaaatcg 
aaaaacttga 
gccctttgac 
cactcaaccc 
attggttaaa 
acgtttacaa 



agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 
tgtggtggtt 
cgctttcttc 
gggcatccct 
ttagggtgat 
gttggagtcc 
tatctcggtc 
aaatgagctg 
ttt 



gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 
acgcgcagcg 
ccttcctttc 
ttagggttcc 
ggttcacgta 
acgttcttta 
tattcttttg 
atttaacaaa 



6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7303 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 16 

ccatggaaac ggtacccagc aggc 24 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 17 

cactccatcg ctatgatgaa ggg 23 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 18 

agttcccttc atcatagcga tgg 23 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer used to amplify human DNA 
<400> 19 

aatgtacagg atgctggggt 

<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 20 

cccttgtagc tggtggagtt ccctt 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 21 

tgtgtcatcg agctggcttc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 



<400> 22 

atcttctgca agtggctcca 
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